yPiversiTN 
FE3 25 1952 
mrenarics CANADIAN 


OURNAL OF MATHEMATICS 


Journal Canadien de Mathématiques 


VOL. IV - NO. 1 
1952 


Foreword 


Sur un probleme de configurations et sur 
les fractions continues Jacques Touchard 2 
Zeta functions on the unitary sphere S. Minakshisundaram 26 


The homomorphic mapping of certain matric 
algebras onto rings of diagonal matrices J. K. Goldhaber 


Contributions to noncommutative ideal theory D. C. Murdoch 
Note on normal decimals H. Davenport and P. Erdos 
On products of sets of group elements H. B. Mann 


The Fourier coefficients of the modular 
function X(r) William H. Simons 


Axioms for elliptic geometry David Gans 


On the geometry of lineal elements on a sphere, 
Euclidean kinematics, and elliptic geometry J. M. Feld 


On the property C and a problem of Hausdorff Fritz Rothberger 
A remark on the existence of a denumerable 
base for a family of functions Fritz Rothberger 


An extension of Meyer's theorem on indefinite 
ternary quadratic forms Burton W. Jones 


Published for 
THE CANADIAN MATHEMATICAL CONGRESS 
by the 


University of Toronto Press 





EDITORIAL BOARD 


H. S. M. Coxeter, A. Gauthier, R. D. James, R. L. Jeffrey, 
G. de B. Robinson, H. Zassenhaus 


with the co-operation of 


A. S. Besicovitch, R. Brauer, D. B. DeLury, P. A. M. Dirac, 
R. Godement, I. Halperin, L. Infeld, S. MacLane, G. Pall, 
L. Schwartz, J. L. Synge, W. J. Webber 


The chief languages of the Journal are English and French. 


Manuscripts for publication in the Journal should be sent to the 
Editor-in-Chief, H. S. M. Coxeter, University of Toronto. Every paper 
should contain an introduction summarizing the results as far as possible 
in such a way as to be understood by the non-expert. 


All other correspondence should be addressed to the Managing 
Editor, G. de B. Robinson, University of Toronto. 


The Journal is published quarterly. Subscriptions should be sent 
to the Managing Editor. The price per volume of four numbers is 
$6.00. This is reduced to $3.00 for individual members of the 
following Societies: 


Canadian Mathematical Congress 
American Mathematical Society 
Mathematical Association of America 
London Mathematical Society 

Société Mathématique de France 


The Canadian Mathematical Congress gratefully acknowledges the 
assistance of the following towards the cost of publishing this Journal: 


University of British Columbia Carleton College 
Ecole Polytechnique Université Laval 
Loyola College University of Manitoba 
McGill University McMaster University 
Université de Montréal Queen’s University 
Royal Military College University of Toronto 
National Research Council of Canada 
and the 
American Mathematical Society 


AUTHORIZED AS SECOND CLASS MAIL, POST OFFICE DEPARTMENT, OTTAWA 





Foreword to Volume IV 


FOR THE past six months the University of Toronto Press has 
been experimenting with a new method of setting mathematical 
formulae, with a view to reducing the hand work involved. 
The reader will notice that in the first five papers in this issue 
of the Journal the indices of the first order are too small while 
in the remaining papers this fault is corrected and the indices 
are larger; this change marks a notable improvement in exist- 
ing practice. It may be of interest to note that only the large 
brackets and braces and the large integral, product, and sum 
mation signs are now inserted by hand; the limits are set in 
place on the machine. The Editors of the Journal are much 
pleased with the advantages of the new system. The time and 
cost of setting are both materially reduced and the alignment 
of first and second order indices is improved. It is hoped to 
bring out shortly a pamphlet of Instructions to Authors which 
will explain the new system and offer advice on the preparation 


of manuscripts. 











SUR UN PROBLEME DE CONFIGURATIONS ET SUR 
LES FRACTIONS CONTINUES 


JACQUES TOUCHARD 


Introduction. Dans un précédent article [6, §4] j’ai essayé de traiter le 
probléme suivant, qui fait l’objet du présent travail: on donne 2n abscisses, 
marquées 1, 2, . . . , 2m, de gauche a droite, sur un axe horizontal. On les joint 
deux a deux par m arcs convexes, tracés au-dessus de l’axe, de maniére que chaque 
abscisse soit l’origine ou l’extrémité d’un seul arc, l’origine étant 4 gauche et 
l’extrémité 4 droite. On obtient ainsi p,,, = 1-3-5-...- (2m — 1) configurations 
et on demande le nombre de celles qui ont p points doubles. 

Je crois devoir rappeler les définitions suivantes. Nous dirons que deux 
arcs C, et C, appartiennent 4 un méme systéme si l'un recouvre l'autre ou si l’un 
coupe l’autre ou si un troisiéme arc C, recouvre C, et C, ou les coupe tous les 
deux, ou encore coupe l’un d’eux et recouvre l'autre. Lorsqu’un systéme S, est 
recouvert par un arc d’un systéme S, et qu’aucun arc de S, n’est coupé par aucun 
arc de S,, S, et S, forment un systéme S, dont S, est un sous-systéme. Nous 
dirons qu’un systéme est propre, lorsqu’il ne contient pas de sous-systéme. 

Ce probléme, je l’avais abordé en partant de la notion des systémes propres, 
qui sont en effet les éléments en lesquels se décompose toute configuration. Je 
suis parvenu ainsi 4 des formules générales, donnant les nombres de configura- 
tions qui ont de zéro a six points doubles, mais la difficulté de former les systémes 
propres m’avait empéché d’aller plus loin. A la fin de l'article en question, j’ai 
indiqué le principe d’une autre méthode. Elle consiste 4 représenter une figure 
ayant p points doubles par x”. L’ensemble des configurations de m arcs est ainsi 
représenté par un polynéme 7,(x), dans lequel le coefficient de x” est le nombre 
de celles qui ont ~ points doubles. De la méme maniére, les configurations de 
n arcs, formant un systéme unique, sont représentées par un polynéme S,(x) et 
celles qui forment un systéme propre par un polynédme P,(x). C'est la dé- 
termination de S,(x) qui est la plus directe et je l’obtiens aux §$§2, 3, 4 en ne 
faisant, somme toute, que généraliser les propriétés d’un triangle arithmétique, 
connu sous le nom de triangle de Delannoy et que je rappelle au §1. De méme 
que les nombres de Delannoy peuvent étre engendrés par une fraction continue, 
de méme la fonction génératrice des polynédmes S,(x) est une fraction continue 
F(x, z) que j’étudie aux §§5, 6 et 7. Je détermine ensuite, aux §$§8, 9 et 10, les 
polynémes 7,(x) et P,(x), ainsi que certaines valeurs numériques. On verra 
qu’il y a une belle réciprocité entre les polynémes P, (x) et S,(x) et on s’explique 
mal pourquoi la détermination des systémes propres parait si difficile, alors que 
celle des systémes propres ou impropres est facile. Le §11 contient quelques 
identités od figurent certaines fonctions d’un usage courant dans la théorie des 


Recu le 22 Novembre, 1950. 


SUR LES FRACTIONS CONTINUES 3 


fonctions elliptiques. Dans le §12, j’effectue la connexion entre la méthode de 
mon précédent article [6] et celle employée ici. Cette vérification était néces- 
saire, car les calculs de mon article [6] reposaient sur la considération de figures 
assez nombreuses que je n’avais pas reproduites et qui pouvaient préter A des 
erreurs. On ne trouvera pas ici l’expression définitive du nombre des con- 
figurations ayant un nombre donné de points doubles; on pourrait sans doute y 
parvenir mais seulement, croyons-nous, au moyen de formules compliquées et 
peu maniables. La fraction continue F(x, z) est intéressante en elle-+méme et 
elle donne, sur le développement de certaines fractions continues, un résultat 
général qui fait l’objet du §13. Comme la détermination de la valeur de F(x, 2) 
est loin d’étre immédiate, j’avais été amené, au cours de divers essais, A former 
les dérivées partielles d’une fraction continue par rapport a ses éléments. Bien 
que celles-ci soient trés aisées 4 obtenir, elles ne figurent, 4 ma connaissance, 
dans aucun ouvrage. Les formules que je donne au §14 m’ont paru_mériter 
d’étre connues. Le §15 contient des tables numériques. 
En suivant Perron [3] et pour gagner de la place, j’ai représenté 














b. + a, a,|, a,| 
ita par b, + iF + b. De aen 
b+... 
et 
b, — a, » a a, | — a,\ _ 
ca par 0, i, i aia 
b, - 


1. Le triangle D} de Delannoy [1] est celui-ci 


ni ®@. 4.2 2 4 





+ iia niall . 
0 1 
1 1 1 
= | 1 2 &§ 
1.4 & a. 
4 1 4 9 14 14 
5 1 5 14 28 42 42 
Il est défini par 
(1) DS = DS... + DS", DS =1. 
Ona 
D= DD = D., + D.-. + Di. +... + Do, 
p= 2=4+1(P +9) 
be p+i1 q ’ 
(2) Di = DD +... + DiDeiri +... + DIDS. 














4 JACQUES TOUCHARD 


Dans tout cet article, ¢(z) désignera la fonction 





_ 1— (1 — 4s)’ 
(3) ¢(z) = > 
(4) z¢'(z) — o(z) +1=0, 
(5) ¢o(z) = Di + Disz+...+ Die"+..., 
(6) [o(z) PP"? = Di + Dist... + Drs +..., 


et l’on a, sous forme de fraction continue, 


1| z| | z| 


a. a. on 


(7) (2) 


2. Pour déterminer S,(x), nous partirons de la remarque suivante. Lorsqu’on 
a un systéme de m arcs, C,, C,,..., C,, dont les origines respectives, comptées 
de gauche 4 droite, sont y,, 7.,.--., Ya, si l'on supprime le dernier arc C,, la 
figure restante est encore un systéme. Car, s'il restait deux systémes, ou bien 
C, n’en couperait qu'un et la figure primitive comprendrait deux systémes, ou 
bien C, les couperait tous les deux et alors C, ne serait pas le dernier arc. Un 
arc, au moins aurait son origine 4 droite de y,. On peut donc, pour former les 
systémes de m arcs, partir des syst¢mes de m — 1 arcs, d’origines y,, y,, . . 
Ya-. et ajouter un méme arc d'origine y,. 


° 9 


Quelles sont les origines des arcs dans un systéme? 
On a évidemment y, = 1. Onavy, = 2, sans quoi C, formerait a lui seul un 
systéme. Onavy, = 3 ou 4, car si l’on avait y, > 5, C, et C, formeraient a eux 


deux au moins un systéme. En général, y, = k,k +1,2+2,...,0u 2k — 2, 
car si l’on avait y, 2 2k — 1, les arcs C,, C,,..., C,-, formeraient a eux seuls 
au moins un systéme de k — 1 arcs et l'ensemble C,, C,,..., Cy, . . . formerait 


au moins deux systémes. On peut donc dresser des tableaux, que j’appellerai 
tableaux 2,, pour les origines des arcs d’un systéme de m arcs. 


G2. 

m:3 2. 

O:i23s.124. 

S:i334,%2235,1236, 1245,1246. 

2,.:12345,12346,12347,12348, 
22336, 22337, 123 38,123967, 12368, 
12456,12457,12458,12467,12468. 


Pour former le tableau Q,,,, on prendra chaque combinaison du tableau Q,; 
soit y, y;---7¥, l'une d’elles; a droite de y, on écrira successivement y, + 1, 
%.+2,...,2n — 1, 2n. 

On peut d’ailleurs former directement le tableau Q, par le systéme d’inégalités 


(8) vy. =1;7. = 2; ...38S we K 2R—2;5...50f an F In — 2; 
WS V2 SVs Mee SM Ve 


SUR LES FRACTIONS CONTINUES 5 


Voici maintenant la maniére d’obtenir les fonctions S,(x) représentant les 
configurations de m arcs C,, C,,..., C, ne formant qu'un seul systéme. Nous 
lexposerons en détail pour essayer d’étre parfaitement clair. Nous poserons, 
dans la suite de cette étude, jusqu’au §12 inclusivement, 


(9) a,=1+x+2°4+...4+2"". 


Pour un seul arc, on a S,(x) = 1 = a,. Pour deux arcs, y, = 2; C, peut ou non 
couper C,, ce qui donne le terme 1 + x = a, et 


S, (x) 


Pour trois arcs, y, = 30u4. Siy, = 3, C, peut couper 0, 1 ou 2 des arcs C, et C,, 
ce qui donne le facteur 1 + x + x* = a,. Sivy, = 4, C, peut couper 0 ou 1 arc 
ce qui donne le facteur 1 + x = a,, de sorte que 


S,(x) = a,a,a, + a,a,a,. 


On voit que S,(x) est formé de deux monémes; dans le premier, la derniére 
lettre 4 droite est a,, ce qui exprime que y, = 3; dans le second, la derniére 
lettre A droite est a,, ce qui exprime que y, = 4. 

Formons encore S,(x). Si y, = 4, ce qui ne peut arriver que si y, = 3, C, 
peut couper 0, 1, 2 ou 3 des arcs C,, C,, C,, ce qui donne le facteur 1 + x + x’ 4+ 
x* = a,, par lequel il faut multiplier le monéme a,a,a,. Si y, = 5, ce qui peut 
arriver si y, = 3 ou 4, C, peut couper 0, 1 ou 2 arcs, ce qui donne le facteur 
1+x*+x* =a,, par lequel il faut multiplier les deux mondmes a,a,a, et 
a,a,a,. Enfin, si y, = 6, ce qui peut arriver si y, = 3 ou 4, C, peut couper 
0 ou 1 arc, ce qui donne le facteur 1 + x = a,, par lequel il faut multiplier les 
deux monémes a,a,a, et a,a,a,. On a donc 


S,(x) = a,a,a,a, + a,a,a,a, + a,a,a,a, + a,a,a,a, + a,a,a,a,. 


Le fait que la derniére lettre 4 droite d'un monédme est a, exprime que y, = 4; 
si cette derniére lettre est a,, c'est que y, = 5; si elle est a,, c'est que y, = 6. 

D’une maniére générale, S,(x) se présente sous la forme d’une somme de 
mondmes homogénes en @,, @,, . . . , @,, que nous désignerons par R,(a,, @,,..., 
a,) ou, plus briévement, par R,(a) et nous supposerons que l’on a pris soin de 
laisser 4 la droite de chaque mondéme la lettre qui a été écrite la derniére, quand 
on a formé R,(a), a partir de R,_, (a). 

Formons S,,,(x) = R,,,(@) et, pour cela, considérons un mondme quelconque 
de R,(a). Supposons que la derniére lettre 4 droite soit a,. Ce fait exprime que 
Y. = netalorsy,,, = +1,2+2,...ou2mn. Siy,,, = 2+ q,l’arc C,,, peut 
couper 0, 1, 2,...ou m — gq + 1 des arcs C,, C,,...,C,, d’od le facteur 


t+xe420°+...4¢ 07%" = Gaiess- 


Ainsi, si y, = , on devra multiplier le monéme successivement par 4,,,, Gp, 
G,-,,-++,4,,@,. Supposons de méme que la derniére lettre 4 droite soit a, 
ce fait exprime que y, = +r et alors y7,,, = »+r+i1,n+r-+ 2, 

2n, ce qui donne les facteurs a,_,,,, Ga--,---, @s, @,, par lesquels on devra 


vr? 








6 JACQUES TOUCHARD 


multiplier successivement le mon6me. Nous avons donc établi les deux régles 
suivantes: 

Premiére Régle. Pour former R,,,(a) a partir de R,(a), on considére tous les 
mondémes de R,(a). Si um monéme se termine a droite par a,;, on le multipliera 
successivement par 4;,,, @;, @;-,,..., @, @, et on additionnera les résultats. 

Deuxiéme Régie. 11 existe une correspondance one-one entre les combinaisons 
des origines du tableau ©, et les monémes de R,(a). Cette correspondance est la 


suivante: si la combinaison des origines est 1, 2, y,, ¥.,.-- + Ya, les indices des 
lettres a; dans le monéme correspondant seront, de gauche a droite, 
1,2,6—y,,8 — y.,10 — y,,..., 2" — Yn. 


Soit y, = 2k — yz ces indices, k = 1, 2,...,m, le systéme d’inégalités (8) se 
transforme dans le systéme 
(10) Y= 1,4. = 2,..-,2 2 re Chee 2 am, 
Ye Seb My eS Year $e Me SL Yann $1, 


qui permet de former tous les monémes de R, (a) et, par conséquent, la fonction 
R, (a) elle-méme. En substituant l’expression (9) de a,, on aura S,(x) par des 
calculs qui se compliquent rapidement. 


3. On peut donner 4 ces calculs plus de régularité de la maniére suivante. 
D'’aprés ce qui précéde, R,(a) est une fonction linéaire des derniéres lettres 
écrites 4 la droite de chaque monéme. On peut donc poser, pour m > 2 
(11) R,(a) = C(n, n)a, + C(n, n — 1)a,-.+...+C(n, i)a; +...+C(n, 2)a,. 
En appliquant la premiére régle du §2, on trouve 

C(in+1,n+1) = C(n, n)a,, 
(12) C(m+ 1,1) = C(n,n)a, + C(n,n — 1)a,_, +... + C(n, i)a, 
+ C(n,i — 1)a;-, 
d'ot l'on déduit 
C(n +1," + 1) = C(n, n)a, 
(13) Cin + 1,") = Cin +1,n+1) + C(n, n — 1)a,_, 


C(in+1,7) = C(n+1,i1+1) + C(n,i — 1)a;-,, 
on peut donc former un triangle 


C(2, 2) 
C(3, 3) C(3, 2) 


C(n,n) C(n,n —1)...C(n,1)...C(n, 2), 


dont la loi de formation est donnée par (13). D’aprés (13), on voit que C(, 2) = 
C(n, 3) et, en faisant dans (12), 7 = 3 et comparant 4 (11) ona 


(14) C(n, 2) = C(n, 3) = R,-,(@). 














SUR LES FRACTIONS CONTINUES 


~ 


Lorsqu’on fait x = 0, on aa, = 1 et l'on retombe sur le triangle de Delannoy, a 
condition de poser, pour x = 0, 


C(n,n — k) = Dh... 


A l'aide de la relation (13), on peut former des triangles numériques pour les 
coefficients de x, x", x*,...dans les polynémes C(m,k). On trouve ainsi que 
les trois termes de plus faible degré, dans S,(x), sont 


Diz} + (nm — 1)Dizix + [()D%i — Dr" ke’, 


mais la loi de formation de ces triangles se complique trés vite et je n’ai pas 
poursuivi dans cette voie. 


4. Une expression indépendante des fonctions C(n + 1,7) peut s’obtenir de 
proche, a l'aide de l’équation (13). D’une fagon générale, C(m + 1, r) s’exprime 
par une somme multiple d’ordre nm + 1 — r, que nous n’écrirons pas, mais nous 
donnerons, 4 cause de son importance, l’expression de C(m + 1, 3) = R,(a@), par 
une somme d’ordre n — 2, savoir 


(15) R.(a)=a0,2 2 Z ...2 6;,,0;,...¢4 


—2 igu2a do=2 in-s 
C’est précisément l’expression A laquelle conduit le systéme d’inégalités (10). 

5. Il s’agit maintenant de trouver l’analogue de la formule bilinéaire (2), 
relative aux nombres D%. Or, c’est trés facile si on se reporte au tableau Q, des 
origine des arcs et a la deuxiéme régle du §2. D’aprés les inégalités (8), il est 
évident que si l’on considére toutes les combinaisons du tableau Q, 

12 YsVe - « s VeVeon s+ 2 Ve 

et si on les divise en deux tranches, l'une formée par les g premiers chiffres, 
l'autre par les m — g suivants, la premiére tranche comprendra toutes les 


combinaisons relatives aux g arcs C,, C,,..., C, et la deuxiéme toutes les com- 
binaisons relatives aux m — g arcs C,4,, Cys.,..., C,. On formera donc le 
tableau 2, en associant: 
le tableau pour C, avec le tableau pour C,, C,... C,, 
- ” Deas | 7 ao ae 
= ” tonite — le Re 


et ainsi de suite. 

Grace a la correspondance entre les combinaisons des tableaux Q, et les 
mon6émes de R, (a), ceci se traduit par la propriété suivante, qu’il suffit d’exposer 
sur un exemple, en indiquant par un tiret la division en deux tranches 

1-234 a,-4a, a, a, 
1-235 a,— da, a, a, 
12-45 a, a,— 4, a, 
123-6 a, a, a,-4a, 
124-6 a, a, a,-4a, 
od l’on voit, dans la deuxiéme colonne, que les groupes de lettres placées a 














8 JACQUES TOUCHARD 


droite du tiret sont dans leur ensemble les mémes qu’A gauche du tiret, mais 
avec des indices augmentés d’une unité. La démonstration est générale et si 
nous désignons par R,(a, 1) la fonction R,(a), dans laquelle tous les indices des 
a, ont été augmentés d’une unité, nous aurons 


(16) R.(a@) = R,(a)R,-_, (a, 1) + R,(a)R,-.(a, 1) +... + R,-, (@)R, (a, 1). 


Posons 


(17) F(x,z) = R,(a) + Ri(a)e+...+ R,(a)2""* +..., 


et soit F,(x, z) la fonction F, dans laquelle tous les indices ont été augmentés 
d’une unité, de sorte que 


F,(x, ) = R,(a, 1) + R,(a, 1) 2+...4+R,(a, 1) 2°" +..., 
nous aurons, d’aprés (16), 
(18) 2F (x, 2) F,(%,2) — F(x,z) +a, = 0. 
Soit alors F,(x, z) la fonction F(x, z), od l'on a augmenté tous les indices de & 
unités, on aura de méme 
(19) 2F,F,., — Fy + ais, = 0 


et, par suite, sous forme de fraction continue 


20 x = 2. SS. SSW a.2| a 
(20) F(x, 2) ir | 1 1 : , 


ou je rappelle que 

F(x, z) = S,(x) + S,(x)s+...+ S.(x)2"'+.... 
Lorsque |x| < 1, la série (17) et la fraction continue (20) convergent, [7, p. 45] et 
[3, p. 258], pour 4|z| < |1 — x|. Pour x = 0, les équations (17), (18), et (20) se 


réduisent respectivement aux équations (5), (4), et (7), mais on peut aussi 
généraliser l’équation (6) a l’aide des polynédmes C(p, g) du §3. Soit en effet, 


ye(z) = C(a,g)+Ciati,g)et+...+Ci+n,q)ze"+.... 


D’aprés (14), y.(z) = F(x, 2) et l’on trouvera, a l’aide des formules (13) et (19) 
que y,(z) = FF,F,... F,-., qui, pour x = 0, g = p + 2, se réduit au premier 
membre de l’équation (6). 

Voici une autre remarque. D’aprés la relation (16), R,(a, 1) s’exprime algé- 
briquement au moyen de R,(a),..., R,,,(@) et, inversement, R,(a) s’exprime 
algébriquement au moyen de R, (a, 1), R,(a, 1)... R,-, (a, 1). Or, si l’on forme 
les fonctions R,(a, 1), on verra que R,_,(a, 1) représente les configurations de n 
arcs C,, C,,...(C, ne formant qu’un seul systéme, mais dont les origines, au 
lieu de satisfaire aux inégalités (8), sont assujetties au systéme d’inégalités: 

y¥, = 1,7. = 2,7, = 3,...,8 & ve & 2k —F,..., NS Yn & 2n — 3, 


VY: < V2 < Ve Sees < Ve 


SUR LES FRACTIONS CONTINUES 9 


Il y a donc des relations algébriques entre les fonctions représentant ces systé¢mes 
particuliers et celles qui représentent les systémes les plus généraux. 


6. Désignons par 





les réduites de la fraction continue (20), qui sont des fonctions de x et de z. On 
les obtient a l’aide des formules de récurrence bien connues et le développement 
en série de Taylor de A,/B, coincide avec le développement (17) jusqu’au terme 
R,(a)z”""* inclusivement. De plus, lorsque la fraction continue converge, on 
peut écrire 





werk aa eee ee 
ou 
: ; A, a a,a, Onsen 
—. aor. 
Or, on trouve facilement que 

1 


—_—- 1+ (2a, + 2a, +... + 2a, + a,,,)2 + 2°€(z), 
n ny. 
e(z) désignant une série entiére qui ne s’évanouit pas avec z. En substituant 


dans (21), on verra que, dans le développement de A,/B,, le coefficient de 2” est 


R,+,(@) — a,a,... d,s, et que le coefficient de 2”** est 
Rusa(@) — G,0,.. . Ans, — O,0,...A,4,(2a, + 2a, +... + 2a, + a,,4,). 
Supposons maintenant que x soit une racine primitive, x = p,, de a, = 0, 


c’est-a-dire de x* — 1=0. On a alors 


F (py, 2) = A,_,(p:, 2) 'By_, (pr, 2); 


F(p,, 2) est donc une fraction rationnelle en z qu’il est aisé de former pour 
k = 2,3,4. On obtient ainsi des relations qui sont précieuses pour vérifier 
l’exactitude des coefficients qui figurent dans les polynémes S,(x). On a 
F(— 1,2) = 1, donc S,(— 1) = 0, » > 2. Puis, 7 étant une racine cubique 
complexe de l’unité, on a 

1 — js + je 


F(j, 2) = —— Sue 
i+:3° 
donc 
S,(j) = (-1)""*j7"", n> 1 
Enfin, pour 7 = — 1, 
. 1 — iz 
Ps) = TF te 


donc, en développant, 











10 JACQUES TOUCHARD 


S(t) = (1 + 2)(1 + 2%)"", n> 2. 
7. Reste maintenant a trouver la valeur de la fraction continue (20). On 


obtient une premiére indication, lorsque, dans (20), on fait x = 1 et qu’on 
remplace z par — z. On a alors formellement 


Fi,-s)= ++ 4 4+... 


C’est la un cas particulier de la fraction continue de Gauss. On trouve en effet 
que 


(22) 1+ 2F(1, —2) =1/(1 —2+ 132° — 13-52 +1:35-72 —...), 


formule qui nous servira plus loin. La divergence de la série au dénominateur 
est ici sans inconvénient; on peut la remplacer par un nombre limité de termes, 
suivis d’un reste. Si l’on veut, 

ee 
7 e “u *du 


i+2F(i,—z) J,1+ 2su 
et l’intégrale peut étre développée avec un reste. Cela étant, on sait que Heine 
[2, p. 284] a généralisé la série de Gauss. II était donc naturel de chercher a 
utiliser les fractions continues de Heine, mais tout ce que j’ai pu obtenir dans 
cette voie, c’est une identité intéressante que voici. Soit 


X(x,z) = 1+ a,2 + a,a,2° + a,a,a,2° + 


a, ayant sa signification habituelle (9); on a 


_ 1 =~ a,z| Xxa,2 x°a,2| x*a.2 
(x, 2) [1 | 1 1 7 








be 


4 


ou, sauf un terme constant et un facteur z, on reconnait la fraction continue (20), 
dans laquelle on aurait remplacé, pour toute valeur de I’indice i, a; par x*~‘a,. 
Si on désigne, d’une facon abrégée, par R,(x*~*a,;) ce que devient alors le poly- 
néme R,(a), on aura 


+ pee 
X(x,2z) 


d’ot l’identité annoncée 


1 — R,(x*a;)2 —... — R,(x*“a;)2" —... 


R,(x*~*a;) + a,R,-,(x""*a;) + a,a,R,_,(x*~*a;) +... 
+ 4,0,0, . . . On-.R,(@,) = G00, . . . Ans. 
Mais, de cette identité, il ne parait pas facile de déduire l’expression de la 
fraction continue (20). J'ai donc cherché a déterminer directement des fonctions 
Q, (x, 2) par le systéme d’équations aux différences 
(23) Q.-. = Q. + (1 — x”)2Q,4:, eo @ 1,2,3,.... 
Il en résultera, d’aprés un théoréme connu [3, p. 291] 








SUR LES FRACTIONS CONTINUES ll 








Q, (1 — x)z| (1 — x")z (1 — x*)z| 
== 1 —— ‘2% 
aes cs with an ea ae emi 
c’est-a-dire 
(24) ¢ = 1+ (1 — x)zF[x, — (1 — x)z]. 
Je me donne arbitrairement Q, = 1 et je pose 
(25) Q. = f(n) +fi(n)e+...+f,(n)2? +... 
avec f,(m) = 1. 
D’aprés (23), les fonctions f,(m) doivent satisfaire aux équations 
(26) f,(n) — f,(m — 1) = (x* — 1)f,_.(m + 1). 
Soit 


$* (x) = given (I a x")(1 — s) see (1 ee cage 
(1 — x)(1 — x’)... (1 — x”) 
avec #2(x) = 1; et soit encore 





m — 2q)(m — 1)(m — 2)...(m—¢q+1) 

E(m, q) = ak foe = Ee ED i” 

et E(m, q) = 0, pour m < 2¢ avec E(m,0) = 1. Les nombres E(m, qg) satisfont 
aux relations 


’ m > 2q 


E(m, 1) 1 + E(m — 1,1) 
E(m,q) = E(m — 1,q — 1) + E(m — 1, q) 
et d’autre part, on vérifiera que 
(27) O.-, — Bh, — B, = (x” — 1) O27h_,, p 


Cela étant, je dis que la fonction 


\W 


p+ 
(28) f,(n) = Z (—1)'"E(m + 2p, i — 1) 8255"? 
satisfait aux équations (26). Pour le voir, il suffira, dans (28), de remplacer 
E(n + 2p,i—1) par E(m —1+ 26,1 — 2) + E(m —1+ 2p,i —1) puis, 
aprés substitution dans le premier membre de (26), de grouper les termes qui 
ont le méme coefficient numérique E(k, g) et de se servir de la formule (27). La 
fonction (28) est donc une solution de l’équation aux différences (26) et c’est 
la seule qui convienne, car toute autre solution ne pourrait en différer que par 
une fonction ¥,(m), de période 1, par rapport a la variable n, et comme nous 
avons posé Q, = 1, il faut, d’aprés (25) que, sauf pour » = 0, nous ayons 
f,(0) = 0, d’od ¥,() - ¥,(0) = 0. 

Faisons maintenant m = 1, dans (28), et observons que les nombres E(m, q) 
ne sont autres que les nombres de Delannoy, dans un autre ordre que dans le 
triangle du §1, E(m,7) = D},_;_,, et nous aurons 














12 JACQUES TOUCHARD 


f.(1) = Di, 
f,(1) - Dix - Di, 


“eee eee ewww nee 


fo(1) = Dex" — Dic?” + Dit — + (— 1)"DB. 


Substituons ces valeurs dans la formule (25), en y faisant m = 1, ordonnons la 
série obtenue par rapport aux puissances de x et ayons recours a la formule (6), 
nous aurons 


(29) Q, = o(—2) + x2¢°(— 2) +... + x? G9” (— 2) +... 
ou, en posant 
(30) A(q,u)=1+qutqu't+...+ qr wt... 


Qi) 14+ 1 — see, - 0 — eps) @ — — —— 


(—2) Alx, 26°( — 2)] 
ou bien 


1 — z¢(z) 
(32) 1 — (1 = x)2F[x, (1 —_ x)z] = Alx,1 — o(z)) 
On voit que la fonction A (gq, “) se rattache aux fonctions @ de Jacobi, mais ce 
qui distingue la fraction continue (20) de diverses fractions continues, obtenues 
autrefois par Heine, Eisenstein [3, p. 315] et Ramanujan [4, p. 215], c’est que 
dans (30), on a substitué a la variable u une fonction algébrique. 
Pour obtenir les fonctions S,(x), on a, d’aprés (24) et (25), 


1— 2 (—1)*(1 — x)"S,(x) = 1/ J f,(1)x’; 


on calculera donc les polynémes f,(1), puis les polynémes 
g.(x) = (— 1) f,(1)(1 — x)™ 

et on aura S,(x) = g,(x) et, pour m > 2, 

SiGe) = go(%) — go-15, — fo-25, — .. - — 8: 5,-1- 

Un autre procédé consiste 4 poser 

B(q,u) = {A(q,u)}"* = 14+ B.(qg)ut+...+8.(g)u"+... 
et il résulte alors de la formule (31), aprés quelques calculs, que 
(33) (1 —x)"S,(@) = Di=i(1 — x) — Dr*B, (x) + Dii8,(x) 

— Dis3B.(%) +... + (— 1)”"D;-.8, (x). 


Les polynémes £,(x) sont faciles 4 calculer. Comme vérification, on doit avoir, 
pour 2 > 2, 8,(1) = 0, 8,(— 1) = 2. Si on les considére comme connus, la 





SUR LES FRACTIONS CONTINUES 13 


formule (33), aprés multiplication par (1 — x)~", donnera explicitement S, (x). 
On trouvera plus loin des tables pour g,(x), 8,(x), S,(x), et P,(x). 


8. Connaissant S,(x), nous avons maintenant a déterminer les polynémes 
T,(x) qui représentent les configurations totales de m arcs et les polynémes 
P,(x), qui représentent les configurations formant un systéme propre. Pour 
la briéveté, soit g(y) = g. + gy +...+ gy" +... ume série de Taylor quel- 
conque; nous désignerons par K(y", g) le coefficient de y" dans le développement 
de g, c’est-a-dire g,, et nous emploierons aussi un langage abrégé en confondant 


les configurations avec les fonctions qui les représentent. Cela posé, 7),(x) 
comprend: 


1°, les configurations ne formant qu’un seul systéme, c’est-d-dire S,(x); c'est le 
coefficient K(z2",zF) de 2” dans zF(x, z). 


2°, les configurations formant deux systémes; c’est }>S,S,, p+ ¢q = nm; c'est 
donc K(z", 2’ F’). 


3°, les configurations formant trois systémes; c'est }>S,S,S,, p+q+r =n; 
c'est donc K(z2", 2’F’) ...et ainsi de suite. Donc 


T, (x) = K(2",2F + 2°F°+...+2"F") 


et on peut prolonger la série indéfiniment, puisque 2"*’F"*’, 2"**F"**,...ne 
contiennent pas de terme en 2". La somme de cette série est 


2F 1 
i—-sF” ~* tice 
donc, en se reportant a la formule (20), 7,(x) est le coefficient de 2” dans le 
développement de la fraction continue 


1| 
~§4+—_ - 
| 


On voit que si, dans la fonction R,,,(a) du §2, on remplace a, par 1, a, par a,, 
a, par a;_,, on obtient 7,(x), donc 


T,,(x) one R,.;(1, a,, as, eee » Qn), n 2 1. 


et on aurait l’expression générale de 7,,(x) en modifiant de la méme facon la 
formule (15). 


9. Pour avoir P,(x), nous remarquerons que tout systéme de m arcs est 
formé par un systéme propre, dont un ou plusieurs arcs recouvrent des con- 
figurations quelconques, c’est-a-dire des configurations totales; celles-ci prennent 
place dans les intervalles entre les pieds des arcs du systéme propre et, s’il s’agit 
d’un systéme propre de yu arcs, il y aura 24 — 1 intervalles entre les pieds des 
arcs. Posons tout de suite 











14 JACQUES TOUCHARD 


x(x,2) = B Ty(x)s" 
(34) w(x,z) = EP, (x)s". 
Nous avons, d’aprés le §8, : 
_ _8F (x, 2) 
(35) x(x, z) = ro) 
et, d’aprés (17), 
(36) 2F(x,z) = 2 S,(x)z". 


Nous ferons S,(x) = T,(x) = P.(x) = 0. 
La fonction S,(x) comprend: 
1°, les systémes propres de m arcs, P,(x); 


2°, les systémes propres de m — 1 arcs, P,_,(x). Il y a ("Y*) intervalles od peut 
prendre place successivement une figure T, (x), d’od le terme 


P,- (x). *)T. (x), 
que nous écrirons P,_,(", ")K(z, x); 


3°, les systémes propres de m — 2 arcs, P,_,(x). Il ya 2m —5 intervalles; on 
peut placer une configuration 7,(x) dans un seul intervalle, d’od le terme 
(".*)P.-:T.; ou bien on peut placer deux configurations 7,(x) dans une 
combinaison de deux intervalles, d’od le terme ("5 °)P,.,7:. On voit 
que ce sont les partitions du nombre 2, savoir 2 = 2 et 2 = 1 + 1, qui se mani- 
festent. L’ensemble des deux termes s’écrit 


P.-C. )K(@", x) + CT)KG’, x’); 
4°, les systémes propres de m — 3 arcs, P,_,(x). Il y a 2m — 7 intervalles, 
donnant en tout 6 places disponibles pour 3 arcs. Si les 3 arcs sont dans un seul 
intervalle, ce qui correspond 4 la partition 3 = 3, on aura le terme P,_,(".")T,. 


Si les 3 arcs sont dans deux intervalles, ce qui correspond aux deux partitions, 
3=1+2et3 = 2 + 1, on aura le terme 


Feu oc Maelo Tol ak 
Si les trois arcs sont dans 3 intervalles, ce qui correspond a la partition 
3=1+1+1, on aura le terme P,-.(". ')Ti. L’ensemble des trois termes 
s’écrit 
P,[CTK@, x) + OTK, x) + CT DKE, x’); 
5°, les systémes formés de nm — 4 arcs, P,_,(x); la partition 4 = 4 donne le 
terme P,_,(", °)T.; les partitions 4 = 1+ 3 = 2 +2 = 3+ 1 donnent le terme 


<¢ @o&P 


et 


et 


et 








SUR LES FRACTIONS CONTINUES 15 


P..(. (7.7. + T. + TT): 
les partitions 4=1+1+2=1+2+%1=2+1+1 donnent le terme 
PC". )3T:T.: 
la partition 4=1+1+1+41 donne le terme 
P...(". Ts. 
L’ensemble des 4 termes s’écrit 
P,-[CT )K(@*, x) + OT" )K (GR, x°) + OT°)K (GR, x°) + OT)K(, x*)] 
et ainsi de suite. On a donc 
S, = P, + P. [CT )KG, x)] 
+ P,-[CT KG, x) + OS )KG, x) +... 
Dans chaque crochet, on peut prolonger la série, car il est clair, par exemple, 
que K(z’, x*), K(z’, x*)...sont nuls. De plus, on peut, dans chaque crochet, 


introduire un terme de la forme K(z’, x*) = K(z’, 1), qui est nul, pourvu que 
p21. On a donc 


S, = P, + P,_,K[z, (1 + x)" "]) +... + P,- Kile’, A+ x) "J 


+...+P,K[z"",1+ x]. 
Le dernier terme est encore exact pour m = 1, car il devient P,(x)K(1,1+ x) = 
P(x), ce qui est exact, puisque S,(x) = P,(x). Le premier terme P, peut 
s’écrire P,K[1, (1 + x)"""] car K[1, (1 + x)*""*] = 1 et cette expression est 
valable pour » = 1. Ainsi 
S, = P,K{i, (1 + x)" ") +...+ PAl2",1+ x] 


et cette formule est valable, méme pour m = 0, puisque P, = 7, = S, = 0. 
Maintenant, il est clair que, quelle que soit g(z), 


K[2’, g(2)] = K[2’"*, 2*g(z)]; 
on peut donc écrire 
S, = P,K[2", 2°(1 + x)" ) + P,-.Kl2", 2° "(1 + x") 
+...+ P,K[2’, 2(1 + x)] 
et ceci exprime que S,(x) est le coefficient de 2” dans le développement de 
P,2(1 + x) + Ps*(1 + x)? +... + Pas*(1 + x)”. 


On peut prolonger cette série indéfiniment, car, au-dela de P,z"(1 + x)”"”’, les 
termes qui viendront ne contiennent plus 2”. Nous avons donc, d’aprés (34), 
1 


S,(x)2" = To 21 + x)’), 


-~Me 


et, d’aprés (35) et (36), 














16 JACQUES TOUCHARD 


(37) {1 — F(x, eo = sF(x, 2). 


[1 — sF(x, “ 
Telle est l’équation qui relie la fonction génératrice w(x, z) des systémes propres 
a la fonction génératrice zF (x, z) des systémes propres ou impropres. 

Définissons une variable u par l’équation 
(38) z— uli — zF(x, z)}* = 0, 
cette équation a une racine {(x, u), que l’on peut développer par la série de 
Lagrange. D’aprés les valeurs de S,(x), données au §15, on a 

f= u— 2u* — (2x — 3)u* — (2x + 6x” — 6x + 4)u* —... 
et il résulte de (37) que 
tF(x, ¢) 

3 2,0) = Phe 
(39) w(x, 2) i —tF.?) 
On vérifiera cette formule en faisant x = 0. Le seul systéme propre qui ait zéro 
point double est représenté par P,(x) = 1, d’ot P,(0) = 1, et on doit avoir 
w(0,u) =u. Or F(O,z) = o(z) et l’équation (38) donne simplement ¢ = 
u/(1+u)*. Alors ¢F(0,¢) = u/(1 + u) et, d’aprés (39), w(0,u) = wu. 

On peut donner une forme remarquable aux équations (38) et (39). Posons 





(40) ¥(x,u) = — uF(x,u) = — 2 S,(x)u’. 
Rappelons aussi que 
(41) w(x,u) = 2 P,(x)u". - 
Alors , 
__ _— (x, f) 

w(e, #) = — TE o,f) 
(42) ou ¢ est la racine, s’annulant avec u, de |’équation 

| f= ull + ¥@, oP =0 


et inversement 





w(x, $) 
x,u)=-=— —=- 
¥@,8) - — TPF) 
(43) ot ¢ est la racine, s’annulant avec u, de |’équation 


o — uli + w(x, £))* = 0. 


La fonction ¥(x, u) se déduit de w(x, ~) par une certaine opération, que dé- 
finissent les équations (43) et, inversement, d’aprés (42), si l’on applique la 
méme opération a la fonction (x, u), on retombe sur w(x, u). Les fonctions 
y et w sont réciproques par rapport a cette opération. 

Voici ce qui en résulte pour les polynédmes P,(x) et S,(x). Considérons une 
fonction s’annultant avec z 





SUR LES FRACTIONS CONTINUES 17 


Cis) = C2e+ C2 +...4+C2"+... 
la racine, s’annulant avec u, de l’équation 
f—uli+ cy) =0 
est, en poussant le calcul jusqu’au terme de degré 5, 
¢ = u + 2C,u* + (SCI + 2C,)u* + (10C? + 4C? + 14C,C, + 2C,)u* 
+ (20C? + 22C! + 72C7C, + 18C,C, + 9C} + 2C,)u* +... 


et on trouve ensuite 


Co) _ . 
i+ C@®~ G,(C,)u + G,(C,, C.)u° +... +G,(C,, C.,..., Cur +..., 
G,(C,, C,,..., C,) étant un polynéme en C,, C,,..., C,. 
G.=C. 


G,=C+C, 

G, = 2C; + 4€,C, + C, 

G, = 10C? — 5Ci + 15CiC, + 6C,C, + 3C3 + C, 

14C? + 20C°C, + 36CIC, + 28CiC, + 28C,C; + 8C,C,4+ 8C,C,+C,,. 


Faisons d’abord C, = — S,(x), puis C, = P,(x) et reportons-nous aux formules 
(40) et (41), nous aurons les formules réciproques 


P(x) = — G,(— S,, — S,,..., — S,), 
S,, (x) G,(P.,, | TTT? 


que j’ai vérifiées jusqu’a m = 4. La réciprocité serait encore mieux mise en 
évidence si l’on posait — S,(x) = S’,(x). 


9 
T 


10. Le nombre des configurations de m arcs qui forment un systéme unique 
est S,(1). On obtient, a l’aide de (22), la récurrence 


S.(1) = Pan — Pan-vSi (1) — Pan-oSs(1) — Pon-oSs(1) — ... — Sos (1), 


ov p., = 1.3.5.....(22 — 1). Le nombre des configurations qui forment un 
systéme propre est P,(1). Il s’agit donc d’avoir w(1, u). Au lieu de la 
formule (39), on emploiera la formule 


w(x,u) = uF(x,f) — uf F*(x, £), 
qui, d’aprés (38), lui est équivalente. On a 
F(i, z) = 1 + 22 + 102° + 742° + 7062* + 81622° + 1099602° +... ; 
la racine ¢ de l’équation (38), pour x = 1, est 
t=u— 2u’°+u° — 6u* — 34° — 356u°+..., 


et on a ensuite 











18 JACQUES TOUCHARD 


w(i,u) = u+ u” + 4u* + 27u* + 248u* + 2830u" + 37782u" + .... 
On calculera beaucoup plus facilement la valeur de P,(— 1), car F(— 1,2) = 1 
et l’équation (38) se réduit 4¢ — u(1 — £)° = 0; la racine qui s’annule avec u est 


._1+2u— (1+ 4u)*- 
eens 2u : 





on a ensuite, d’aprés (39), 
w(— 1,4) = ud(— u) = Dou — Diu’? + Di’ —..., 
donc P,(— 1) = (— 1)"""Dx=. Soit N,(n) et N,(m) les nombres des systémes 
propres de m arcs, qui ont respectivement un nombre pair et un nombre impair 
de points doubles, on a 
N,(n) — N;(n) = (— 1)"" Deri, 


et D*-' est le nombre des configurations de m arcs, sans points doubles. 


11. On obtient de la maniére suivante des formules qui paraissent intéres- 
santes. Récrivons l’équation (31) ou (32) sous la forme 
1 1 
(44) a Ee 
¢—->) A[x,1— o(7—,)) 


On donnera a ¢ une valeur telle que le second membre de (44) prenne une forme 
simple. On aura une premiére identité en développant en série le premier 
membre par la formule (36). Substituons ensuite cette valeur de ¢ dans |’é- 
quation (38); on en déduira la valeur de u, puis l’expression de w(x, u), par la 
formule (39) ou par la formule plus simple ° 


3 
(45) w(x, u) = (= ~ 1, 


qui lui est équivalente et od la détermination du radical est celle qui se réduit 
a+ 1, pour «= 0, puisque w(x,0)=0. D’ot une deuxiéme identité en 
développant le premier membre de (45) par la formule (34). On devra faire 
bien attention de prendre, pour la fonction algébrique ¢(z), La détermination 
(3), holomorphe a l’origine et que, a la fin de ce paragraphe, j’appellerai ¢, (2), et 
non pas la détermination conjuguée ¢,(z), infinie a l’origine, seconde racine de 
l’équation (4). De sorte que, dans l’équation (44), on ne peut pas donner 
n’importe quelle valeur a 


o(). 


1-—*x 


Soit ¢ — d’ot o( 4) = 2. D’aprés (44), 


1—<x 1—*x 1 
(46) jee: ing («, i=) = Drew)’ 
ou 


SUR LES FRACTIONS CONTINUES 19 


A(x) = 1—x4+x2° —x°4+ x — x" 4+.... 
D’aprés (38), « = (1 — x)A’*(x) et, d’dprés (45), 
w(x, u) = 2A(x) — 1, d’od les identités 





(48) > P,(x)(1 — x)"A"" (x) = 2A(x) — 1. 
Soit ¢=-—- = , o( )}=1-—~x et soit 


1—*x rs 


w(x) =14+2°4+2°4+2°4+2%*4+2%"4+ 


on aura de méme 


> (- en mae © 
(49) 2 (-19S.@)q— b= sition tie 


(50) - 2 (— 1)"P,(x)x"(1 — x)"u'"(x) = (1 — x)u(x) — 1. 


Dans (38), (44) et (45), changeons x en x’ et on verra d’une maniére analogue 
qu’en posant 


f. _— x*/(1 = x"), i.=- x(1 + x)/(1 = x), i, = x(1 — x) ‘(1 + x) 








ona 

51 2x*/* 2. - 1 0 

” ei ae tones) ~ 

2x 1 

52 1 + ———- ———_—_ = 9,(0, 

ae 1— x 1 — ¢,F(x’, ¢,) “ 
: 2x 1 

53 1-——— = 6,(0), 

(53) i+=1—¢.F@.t) (0) 

ou 


0,(0) = 2x*/* + 2n* 4+ 29° 4 
0,(0) = 1 + 2x + 2x* + 2x°+..., 
6.(0) = 1 — 2x + 2x* — 2x° +... 


‘9 


et l’on aura des formules correspondantes pour w(x’, u). Des formules ci-dessus 
on tire des fractions continues plus ou moins simples, par exemple 








—3 1 — x'| 1-x 1 — x* 
ote ook wie i = al - | 


Supposons pour simplifier x réel et positif,0 <x <1. Lasérie F(x, z) converge 
absolument sur tout son cercle de convergence |z| = }(1 — x). La série w(x, u) 











20 JACQUES TOUCHARD 


converge certainement pour 4|u| < 1 — x, car les systémes propres ne forment 
qu’une partie des systémes propres et impropres et P,(x) < S,(x). Je ne suis 
pas actuellement en mesure d’indiquer quel est le rayon de convergence de 
w(x,u). Sauf pour x = 0, ce rayon de convergence est fini, car, a l’aide des 
formules que j’ai données, dans mon article [6], pour le nombre des systémes 
propres de p arcs, ayant p — 1 ou p points doubles, on peut démontrer que la 
série w(x, u) est divergente pour |u| > 4/(27x). II résulte de cela que la série 
(47) converge pour 0 < x < 1; la série (49) pour 0 < x < 3 — 20/2 = 0,1716; 
la série F(x*,f,) pour 0 < x <+/2 —1 = 0,4142; la série F(x’,¢,) pour 
0<x< 3— 2v2; la série F(x’,¢,.) pour 0 <x<1. La série (50) et les 
séries w(x’, u) correspondant aux valeurs ¢,, ¢,, ¢, convergent pour des valeurs 
suffisamment petites de x, car les expressions correspondantes de u contiennent 
x ou x" en facteur. Mais la convergence de la série (48) reste actuellement 
douteuse, car A(x) est supérieure 4 } et la condition 4(1 — x) A*(x) < 1 — x 
n'est pas remplie. 

Considérons maintenant, et en supposant toujours 0 < x < 1, la fonction 
F(x, 2), prise dans toute sa généralité. D’aprés (44), c’est une fonction A deux 
branches, l’une ©, (x, z) donnée par la détermination ¢,(z) de ¢(z); c’est celle 
que, jusqu’ici, nous avons constamment appelée F(x,z); l'autre, ,(x, 2) 
donnée par la détermination ¢,(z) de ¢(z). On peut définir sans ambiguité 
¢,(z) et ¢,(z) dans tout le plan z et par suite aussi ,(x, 2) et ®,(x, 2), par les 


équations 
mate oat 
1 — 2@,(x,z) 6.(; —*x al =. 1 
1 Zz 
i — 26,(@, 2) ~ (42) Al si ol ) 


¢,(z)¢,(z) = 1/z, 1 — ¢,(z) = 1/[1 — ¢,(z)] 


et, d’autre part, 








Or, 


oo 


(54) A(x,v) += L Als, y= yt 9" 4 9") = p(x, 2), 


n=0 


ot 


p(x,v) = (1+ tl (1 — x")(1 + x"v)(1 + xv" 


n=1 


- = 
multiplions par ¢, = et nous obtiendrons, aprés un calcul simple, 
1-—<x 


1 1 Zz Zz 
(55) 1 —2%,(x,z) 1 — 24,(x, z) ws o(,2-) ° E es o{;2--)} 


C’est la formule que nous voulions établir. Soit alors 1 — .(; =) = — x” 


est une fonction @ proprement dite. Dans (54), faisonsv = 1 — +(7—) et 





on en tire 


SUR LES FRACTIONS CONTINUES 21 


x’(1 — x) 
= ra coe <r ’ 
(1 + x”) 
et cette expression ne change pas quand on change pen — p. Le second membre 
de (55) s’annule pour toutes ces valeurs de z et s’annule aussi pourz = ©. La 
fonction générale F(x,z) admet donc non seulement les points doubles 


z = }(1 — x) et s = =~, qui sont les points de ramification de +(7—): mais 


(56) : p = 0,1, 2,3,..., 


un nombre quelconque fini de points doubles donnés par (56), pour p = 1, 2, 3, 
..et qui ont z = 0 comme point limite, point essentiel de ®, (x, z). 


12. Le lecteur est maintenant prié de se reporter A mon article [6]. Les 
symboles et formules appartenant a cet article seront suivis de Il’indication [6]. 
Je me propose de montrer que l’équation [6, (13)], n’est autre chose que 
l’équation 


(57) wlx, 2[1 + x(x, 2)]"] = x(x, 2) 
qui se déduit de (37) en éliminant F(x, 2), a l’aide de (35). 

D’abord U,,,(p) est le nombre des configurations de m arcs qui ont p points 
doubles [6]; c’est donc le coefficient de u” dans T.,(u), c’est-A-dire le résidu A 
l’origine, au sens de Cauchy, de T,,(u)/u’**. Il y a exception pour” = 0, p = 0, 
car nous avons posé U,(0) = 1, tandis que 7,(u) = 0. Donc, d’aprés [6, (5)], 
f,(x) est, pour p = 0, 1, 2, 3,..., le résidu A l'origine de 


[1 + 7,(u)x* + T,(u)x* +...]/u”: 


1 1+ x(u, x’) 
S,(x) = si ose. 


du, 


pri 


u 
y étant un petit cercle qui entoure |’origine. 


D’aprés [6, (9)], et en supposant |z’| < |u|, pour étre autorisé A sommer, 
nous avons ensuite 





1 u, x’) 
+ x( = du. 
rm a 


Mais la fonction x(u, x*) est holomorphe au voisinage de |’origine et, d’autre 
. i ol i ° . ° 

part, puisque |z"| < |u|, le cercle y contient le point z*. On a donc simplement, 

d’aprés l’intégrale de Cauchy, 


(58) y(x, 2) = x2[1 4- x(s*, x*)]. 


De méme, ¢,,(~ + m — 1) est le nombre des systémes propres de m arcs 
ayant p+ mn — 1 points doubles [6]; c’est donc le coefficient de u”**"* dans 


P,(u) et, d’aprés [6, (7)], g,(y) est le résidu A l’origine de o(u,%) /u?. 
u 
Ensuite, d’aprés [6, (8)], et en supposant |z*| < |u|, 


3 7) du 
(59) G(y, 2) = | ws(w, ; 


Y 











22 JACQUES TOUCHARD 


Mais il est facile de voir que la fonction uw(u, y*/u) est holomorphe 4 I'’origine. 
En effet, 


P,, (u) = o4,(0) + o.,(1)u +... +6..(p)u?+..., 


et il n’existe pas de systéme propre de # arcs, sip < m — 1. Donc le polynéme 
P,(u) est divisible par u”~* et 


y . P 
uw ott = P.(u)y’+...+— 


est réguliére pour u = 0. La formule (59) se réduit donc a 
(60) G(y, 2) = 2°w(z", y*/z*). 
En portant les expressions (58) et (60) dans l’équation [6, (13)], que je récris 
zy = xz’ + xG(y,2 
et en divisant par xz , on obtient 
x(z", x”) = w{2", x*[1 + x(2’, x*)]*}, 


et il suffit de remplacer z par x! et x par 2 pour obtenir l’équation (57). 

Resterait 4 obtenir les équations algébriques auxquelles satisfont les fonctions 
g,(y) ce que nous n’avons pas fait [6]. J’ai remarqué a ce sujet que, d’aprés 
[6, (14)] et [6, (15)], les équations satisfaites par g,(y) et g,(y) sont les équations 
de deux cubiques unicursales. On vérifiera en effet que, ¢ désignant un para- 
métre, 


y=t-?, 
g(y) =’ —t, 
t*(#? +1) =f 
.y) = -—— 
&.(y) 32? — 1 


13. Si l'on se reporte au §5, on voit que la fraction continue qui figure au 
second membre de (20) résulte de la relation (16) et celle-ci est une conséquence 
des deux régles du §2, qui nous ont permis de former les polynémes R,(a). Ce 
résultat subsiste évidemment quelle que soit la signification qu’on donne aux 
lettres a,, @,,@,,.... Nous avons donc obtenu, sous forme entiérement explicite 
et sans avoir recours a la formation de ses réduites, le développement en série 

ae + a2 + asl +...= 2 (—1)"R,(a)2” 


n==1 





d'une fraction continue quelconque du type indiqué. Les polynémes R,(a) sont 
donnés par la formule (15). Si on avait 4 développer la fraction continue 


G(z) = 6. + 


a,z| 
| b, 


a.z| 


le, * 





hs be 
on l’écrirait 


G(z) ; 
"4 











az/bb, 


“T3 


o/b + Gores ac 


SUR LES FRACTIONS CONTINUES 23 


et l’on aurait 


G(z)/b, = 1+ 2 (—1)""R, (a, b)2", 
ot R, (a, 6) désigne la fonction R,(a) dans laquelle on a remplacé a, par a,/b,_,), 
(¢ = 1, 2, 3,...) et l’on modifierait en conséquence la formule (15). 


14. Soit 


| | 
Mob, + +e +... 


une fraction continue quelconque et soit R, = A,/B, la kéme réduite ou fraction 
convergente, R, = b,, R, = (b,.b, + a,)/b,,.... Le symbole R, n’a plus ici 
aucun rapport avec la fonction (15) des sections précédentes. En se servant 
de la formule élémentaire qui donne |'expression de M au moyen d'un quotient 
complet, en y remplacant a, par a, + x, ) par b, + y et en développant en série, 
on trouve 


(— 1)" (a,a,...a,)"*" °**M 
_ a oo Ss: Se... M 
(m+n—'*)! day db, 


= (B,. tia Cae —_ Seay 
. [(m + n)B,_,B,_,.M — mA,_,B,_, — nA,_,B,-,]. 





En particulier 











a”’M 
4 (Ri-, sas R,-,)"a;—— = (M — R,-,)’(M — Mists p > s 
p: 0a’, 
1 \p »o M P —P \p+s 
p! (Ri, = R,-,) * opp = (B,-,) (B,-.) (M = Ro-,) ’ p 2 1. 
D’autre part, on a aussi 
M/b, =1+ 2, /bsb.| 4 Selbibel 4 Safed 
ie: % | 1 | 1 
et, en considérant (b,5,), (b,5,), (6,b,), . .. comme des variables indépendantes, 
on trouvera par un procédé analogue 
(— 1)” 7 >» OM 
p! (R,-, R,-;) (b,_,d,) 8(d,_.d,)” 


= (B,_,)’(M — R,-,)(M — R,-.)’. 
En comparant ces diverses formules, on arrive A des équations aux dérivées 
partielles qui peuvent, a vrai dire, s’obtenir directement et dont nous ne citerons 


que celles-ci, 
a( my 40M eM _y 








Oa, Ob,-, db, 
, 2M aM = eh ae F £u | 
* da, 00,0, od L da, : dap J 











24 JACQUES TOUCHARD 


Inversement, certaines relations différentielles, faciles A établir, peuvent con- 
duire, a l'aide des formules ci-dessus, A des identités dont nous citerons la 
suivante 


(B,M > A,)(B,_,M — A,.,) - a,_,0,(B,_.M a A,-,)(B,-.M — A,-,) 
= ab,-,(B,-.M — Ax-_,)* + b,(Bi-,M — Ay-,)’, 


vérifiée quel que soit M et qui, fort probablement, est déja connue. 


5, = 1 

S,=1+x 

S, = 2+ 4e + 3x° +’ 

S, = 5+ 15x + 21x” + 18x° + 10x* + 42° + x° 

S, = 14 + 56x + 112x* + 148x° + 143x* + 109x* + 68x* + 35x" 
+ 15x° + 5x°+ x’ 

S, = 42 + 210x + 540x* + 945x* + 1255x* + 1353x° + 1236x° 

+ 984x" + 696x° + 441x° + 250x*° + 126x** + 56x"’ 

+ 21x** + 6x** + x" 


P,, (x) 

P,=1 

P,=<x 

P, = x° + 3x’ 

P, = x* + 40° + 10x* + 12x’ 

P, = x°* + Sx° + 15x° + 35x" + 60x° + 77x + 55x* 
gn (x) 

g. = 1 

g.=x+2 


g.= x’ + 3x° + 6¢4+ 5 

g. = x* + 4° + 10x* + 20x” + 28x" + 28x + 14 

g, = x? + Sx” + 15x" + 35x" + 70x* + 117x° + 165x* + 195x° 
+ 180x* + 120x + 42 

ge = x* + 6x"* + 21x” + 56x"* + 126x"* + 252x"* + 451x° + 726x" 
+ 1056x" + 1386x* + 1617x* + 1650x* + 1430x* + 990x” 
+ 495x + 132 


SUR LES FRACTIONS CONTINUES 25 


8.(q) 


B, == q 

r<-¢ 

=_— ¢g + 2q° — ¢g 

B. = q a 3q° “+ ¢q oo 2q° — q° 


> 
uu 


B, = — gq’ + 4¢° — 3q° — 3g° + 2¢° + 2q" — g” 
B. = gq aa 5q’ + 6q" + 3q° an, 6q"° —_ 2q°° + 2q"* + 2q°° — gq: 
B, = «= q + 6q° ase 10g° a g° + 12q*° ar 3q°° + g** vate: 6q"* + 2q"* 


anni 3q"” + 2q°* + 2q”" a gq" 
B. _ q _— 7q° + 15q°° — 4q”* — 19q** + 12q** + q** + 9q"° — 3q°* — 6q"" 
+ 4q"* 5 as 6q"° + q’ 4. 2q”" ae 3q” + 2q** + 2q”" eo q*" 


n | 1 2 3 4 5 6 7 


/ 


Si) | 1 2 10 74 706 8162 109960 








P,(1) 
= om £4  . ¥ 
P.(1) | 1 1 4 27 248 2830 37782 


BIBLIOGRAPHIE 


1. A. Errera, Un probléme d'énumération, Mémoires publiés par |’ Académie 
Belgique, tome 11 (Bruxelles, 1931). 

. E. Heine, Handbuch der Kugelfunktionen, 2° éd., tome 1 (Berlin, 1878). 

O. Perron, Die Lehre von den Kettenbriichen, 2* éd. (Leipzig et Berlin, 1929). 

S. Ramanujan, Collected papers (Cambridge, 1927). 

J. Touchard, Sur un probléme de configurations, C. R. de l' Académie des Sciences (Paris), 
juin 1950. 

J. Touchard, Contributions a l'étude du probléme des timbres-poste, Can. J. Math., tome 2 
(1950), 385-398. 

- H.S. Wall, Analytic theory of continued fractions (New-York, 1948) 


royale de 


2 2 9 PeN 


Lausanne, Suisse 








ZETA FUNCTIONS ON THE UNITARY SPHERE 
S. MINAKSHISUNDARAM 


1. Introduction. In an earlier paper [5], the author defined a zeta function on 


the real sphere x} + x) + ...-+ xi., = 1, whereas in the present paper it is 
proposed to define one on the unitary sphere x,Z, + x,%,+...+ %4,,%:4, = 1 
where x,’s are complex numbers and &,; their complex conjugates. Following 


E. Cartan, harmonics on the unitary sphere are defined and then a zeta furction 
formed just as in the case of areal sphere. The unitary sphere is seen to behave 
like an even-dimensional closed manifold, since results similar to the ones proved 
by the author and A. Pleijel [6] for closed manifolds (of even dimensions) are 
observed here also. 

The zeta function on the real sphere may be viewed as a zeta function associ- 
ated with the orthogonal group D(m) while the one on the unitary sphere as 
that associated with the unitary group lUl(m). It is clear that one could give a 
suitable definition of a zeta function on any compact group [8]. If the group 
acts transitively on a closed manifold with a metric, one could use the idea of 
harmonics on this manifold [3] and define a zeta function. One could still 
define harmonics on groups [7] by taking the group itself as the manifold on 
which the group may act. But in all these cases one should be able to obtain 
these harmonics as eigenfunctions with associated eigenvalues. That this is 
possible was shown by Casimir [4]. According to him the g@lements of the uni- 
tary representations of a compact group are the eigenfunctions of a second order 
differential operator, now known as the Casimir operator. If the eigenvalues of 
the operator are \, with the eigenfunctions ¢,, then 


$.(P) dn (q) 
lr. I” 








z 


will be defined as a zeta function (if the spectrum of the operator is not discrete 
one will have to use suitable modifications of the definition). Since this series 
will converge for sufficiently large values of R(h), one may study the analytic 
continuation of the function so defined.! While the discussion of the properties 
of such functions on abstract groups remains an open question, special cases lend 
themselves to easy treatment. 


Received August 24, 1950. Work completed under contract with the office of Naval 
Research. 

If what Dr. I. Singer has communicated to me is true—that the Casimir operator for a 
compact Lie group is the Laplace-Beltrami operator—the results that Pleijel and I have proved 
carry over easily to this case. As pointed out by Hermann Wey] [7] the functional equation, 
if any, will have to be obtained. 


26 





— Ww \é we \e 


— 


ZETA FUNCTIONS ON THE UNITARY SPHERE 27 


2. Harmonics on the unitary sphere. We enumerate a few properties of 
harmonics on the unitary sphere x,%, + x,%, + ...+ 4.2%, = 1. A full 
account of them is to be found in Cartan’s book on projective geometry 
(2, chap. V]. Let V = V(x,,x,,..., Xeew%.,---,Zee:) be an integral poly- 
nomial in the 2k + 2 variables x,,Z;, (¢ = 1,...,% + 1), homogeneous and of 
degree n in the variables x, and also homogeneous and of degree in the variables 
Z,. This polynomial is said to be a harmonic of order 2, if it satisfies 








k+a 2 
a°*V 
= > = 
ave Po Ox, OE; ° 


There are only a finite number of linearly independent harmonics of order n, 
their number &, being given by 


k, = k(k + 2n) (@+ne +2)... +n =uy. 


n! 


It is a characteristic property of these polynomials, that any transformation 
effected on a point of the sphere (leading to another point of the sphere) changes 
a harmonic of order n to a harmonic of order n which is a linear combination of 
a basic set of k, linearly independent harmonics. If we choose the bases to be 
normal and orthogonal on the sphere and if they are denoted by 17"(M), U2(M), 
..., Uz,(M), where M is a point on the sphere, then the expression 


U"™M)U"M') +...+ UZ(M)UL(M), 


where M’ is another point on the sphere, is invariant for the group of transfor- 
mations on the unitary sphere, viz., the unitary group. Further, the expression 
is a function of the geodesic distance between the points. In fact, if r is the 
geodesic distance, it is a polynomial in cos 27 — L,(cos 2r), say, satisfying the 
differential equation 


Q—27)L° —((k+1)e+k—1)L +n(n+k)L =0, z = cos 2r. 


We at once identify a polynomial solution of this equation as the Jacobi poly- 
nomial p*-*:*(z) 


(— 1)” d” 


2°n! da" 


(1 — 2)*"“P,-"*(s) = {(1 — s)"*°"(1 + 2)"}. 








More precisely, 


ka = 
> UN(M)U"(M’) = OF +») nthmay DI-*° (cog Dr) 


2 Sy Pe Ps), 


where V is the volume of the sphere and z = cos 2r. 








28 S. MINAKSHISUNDARAM 


3. Zeta function on the unitary sphere. Inasmuch as we can associate the 
eigenvalue n(n + k) with any harmonic of order n, we define a zeta function as 
the analytic continuation of the function represented by the Dirichlet’s series 


32 1 kn . 
2 2 U(X 

(1) n=1 n*(n + k)* r= ( ) ( 

1 = (2n+k)PL**(1)Pr-**(z) 

EV see n*(n + k)* 





‘ s=o-+ir 


which has a half-plane of convergence, viz., R(s) > k. 

We show that this function is an entire function of s with simple zeros for negative 
integral values of s provided M # M' and a meromorphic function of s with simple 
poles ats = 1,2,...,kif M = M’. 

The proof proceeds along the same lines as in our earlier paper [5] and we 
briefly indicate it here. We need the following [1] 

LEMMA. 


> (2n + k)P*-**°(1)P£-*'*(cos 2r) t” 
kR(i-—t 
kG - 4) r(* £2 Se 


2. Sos 


“eegrr ns eae 





or 


> (2n + k)P*~**°(1)P*-*** (cos 2r)e~"' 
(2) ” 
Rk i: sinh 3¢ es k+2 coy) 
= = ett SO P(A re 1, SR) 
2" cosh 3¢ 2 2 cosh” $¢ 
From the above lemma we obtain an integral representation of the Dirichlet’s 
series (1) as in [5, (15)]. 





* (2n + k)Py **(1)P.-*** (cos 2r) 
~ n' (n +k)’ 


- (sy I'(s + 4) a fk sinh at A +1 k+2. 4 east ~) 


(3) 


k ~ T(2s) — ° \o# (cosh 4t)*** CO ale oad “eo 


—é their, enya. 


If R(s) > k, the series on the left converges absolutely and is represented by the 
integral on the right which converges absolutely if R(s) >0. So the function 
represented by the series on the left can be continued up to R(s) = 0. We shall 
show, however, that it can be continued over the whole plane adopting the usual 
procedure of replacing the integral on the right by a contour integral. 

In the complex ¢ plane make a cut along the positive side of the real axis from 
the origin and take a contour C from © in the upper half of the plane, going 





— “ 


ZETA FUNCTIONS ON THE UNITARY SPHERE 29 


round the origin in the anti-clockwise direction and then going back to © in the 


lower half of the plane (poles of the integrand being in the exterior of C). Now 
consider the integral along C, viz., 


1 | f k_ sinh 3 (* +1 k+ cos’ r ) ion 
ee. ee ee 8 oe ear ee 
2a1J C lo (cosh $t) 2 2 cosh dt 


1 i f iw (ee—1) 1 [ 

, e — 

221 J @ + J + 2ni J, 

The second integral on the right, taken along a circle round the origin, is zero 
since the integrand is regular and hence for R(s) > 0, we have 


‘S| _ sin (2s — 1)¢ 4 
2midc T nae 
Thus 


(s) = (2n + k)P.**(1)P.-***(cos 2r) ee op r(s + $)T(i — 2s) 
—~ n*(n + k)* h 


for R(s)>k. The integral along C is as in (4). Since the contour integral is finite 
for all values of s, it represents a regular function of s. Thus the series on the 
left represents an analytic function which can be continued throughout the plane 
with the possible exception of the simple poles of ['(1 — 2s)T(s + 4), viz., half 
odd integral and positive integral values of s. But the integral is seen to vanish 
for these values of s, since the integrand is then a regular function of ¢. To ob- 
serve that the negative integral values of s are the “trivial’’ zeros of the function, 
we have only to note that the residues of the integral are zero. We split the 
integral as the diffcrence of two, viz., 


eee fats 223 cos" Fr 


’ ’ es ) — t -_ = — 1 bt dt 
C (cosh }t)*** 2 2 cosh’ 4% sais (— Bh) « 
and 


(— t)'*2,-4(— 4kt)dt = err? 








f 
wi Je 


f — pee -4 
Jee Oa Bd) dt. 


That the residues are zero in the first case is proved in view of the fact that the 
integrand, is an even function of ¢ for real t, when s is a negative integer. The 
residues of the second integral are easily calculated to be zero, when s is a nega- 
tive integer, using the familiar formulas for ey 47, (ARE). 

When the two points M and M’ coincide, i.e. when r = 0, we obtain 





5 (2m + kPe AYN" _ Pfs + ¥) PA — 25) 
athe n'(n + k)’ 271 


{k sinh 4} k+1 bk+2 1 om z 
Jigen os tL) ¢(—#)°*,_4(— 4ht)dt. 





2°’ 2’ " cosh* 4 











30 S. MINAKSHISUNDARAM 


As in the previous case we observe that analytic continuation over the whole 
plane is possible with the possible exception of the simple poles of I'(s + 4) 
I'(i — 2s), viz., half odd integral and positive integral values of s. That part 
of the integral containing e~**‘ gives no difficulty, since the integrand is regular 
for half integral values of s. The first part of the integrand is found to be an 
even function of ¢ for real t, when s is half an odd integer, taking into account the 
singularity of the hypergeometric function at ¢ = 0. Therefore the residues 
are zero. Thus half odd integral values do not contribute poles. Further, 
since we know that the function is regular for R(s) > k, we observe that the 
only poles are s = 1,2,...,2. 


REFERENCES 


- W.N. Bailey, Generalized hypergeometric series, Cambridge Tract No. 32 (1935). 

2. E. Cartan, Legons sur la géométrie projective complexe (Paris, 1931). 

, Sur la détermination d'un syst2me orthogonal complet dans un espace de Riemann 
symétrique clos, Rend. Circ. Mat. di Palermo, vol. 53 (1929), 217-252. 

. H. B. G. Casimir, Rotation of a rigid body in quantum mechanics, Leiden thesis (1931). 

- S. Minakshisundaram, Zeta function on the sphere, J. Ind. Math. Soc., vol. 13 (1949), 
41-48. 

6. S. Minakshisundaram and A. Pleijel, Some properties of the eigenfunction of the Laplace 

operator on Riemann manifolds, Can. J. Math., vol. 1 (1949), 242-286. 
7. H. Weyl, Harmonics on homogeneous manifolds, Annals of Math., vol. 35 (1934), 485-499. 





ae 





8. , Ramification, old and new, of the eigenvalue problem, Bull. Amer. Math. Soc., 
vol. 56 (1950), 15-139. 
Waltair, S. India . 


THE HOMOMORPHIC MAPPING OF CERTAIN MATRIC 
ALGEBRAS ONTO RINGS OF DIAGONAL MATRICES 


J. K. GOLDHABER 


1. Introduction. The problem of determining the conditions under which a 


finite set of matrices A,, A,,...,A, has the property that their characteristic 
roots A,;,Asj,--+,Any (7 = 1,2,...,”) may be so ordered that every poly- 
nomial f(A,,A,,...,A,) in these matrices has characteristic roots f(A,;, \.5, 


~++,An;) (j = 1,2,...,) was first considered by Frobenius [4]. He showed 
that a sufficient condition for the (A; ) to have this property is that they be com- 
mutative. It may be shown by an example that this condition is not necessary. 

J. Williamson [9] considered this problem for two matrices under the restric- 
tion that one of them be non-derogatory. He then showed that a necessary and 
sufficient condition that these two matrices have the above property is that they 
satisfy a certain finite set of matric equations. 

N. H. McCoy [7] showed that a necessary and sufficient condition that A,, A,, 
...,A, have the above property is that A,A, — A,A, (r,s = 1,2, ...,) 
belong to the radical of the algebra generated by the (A;). It may be noted that 
while on the one hand McCoy’s condition removes the restriction that one of the 
matrices be non-derogatory, it does not, on the other hand, give a criterion, 
such as the Williamson condition, which may be easily computed. 

In a part of the following investigation it is proved that if % is a matric algebra 
such that the sum: of every two matrices of & has characteristic roots which are 
the sum of the characteristic roots of the two matrices, then every finite set of 
matrices of & has the above property. This is a small step forward in an 
attempt to recover the computability of the Williamson condition. 

The following mapping theorem, which is used in the proof of the above 
theorem, is also proved. Let %& be an algebra over an algebraically closed field 
&- Let B be an algebra over §. Let © be a mapping of & onto B which (1) 
maps the identity of &, if any, onto the identity of %, (2) is linear, and (3) maps 
zero divisors into zero divisors in a strong sense. Then # is a homomorphism 
of & onto B modulo its radical. 

Also included in this investigation is a proof of the McCoy condition which is 
somewhat simpler and more direct than the one originally given by McCoy. 

The author wishes to thank the referee for his many helpful suggestions, and 
in particular for his suggested proofs of Lemma 3.1 and Theorems 4.2 and 5.1. 


2. Some known results on the structure of algebras. All the theorems of 
this section either appear in [1], or are immediate consequences of theorems 


Received August 28, 1950. 
31 











32 J. K. GOLDHABER 


which appear there. Throughout the discussions of this and subsequent sections 
* shall denote an arbitrary algebraically closed field. 


THEOREM 2.1 [1, p. 14]. If D is a division algebra over §, then D = §. 


THEOREM 2.2 [1, p. 44]. If Wis a semi-simple algebra over §, then U is sepa- 
rable over ¥. 


THEOREM 2.3 [1, p. 39]. Jf W is a simple algebra over §, then U is a total 
matric algebra over ¥. 


THEOREM 2.4 [1, p. 39]. If UW is a semi-simple algebra over §, then either U is 
a total matric algebra over § or WU is expressible as the direct sum of total matric 
algebras over §. 


THEOREM 2.5. If Wis an algebra over §, then 
A=-(M. OMLO...9ML) +N 


where the IN; are total matric algebras over § and where N is the radical of UA. 
(The symbol @ denotes direct sum and the symbol + denotes supplementary 
sum.) 


THEOREM 2.6 [1, p. 40]. A commutative semi-simple algebra is a direct sum 
of fields. 


THEOREM 2.7 [1, p. 44]. Let & be an algebra over R. Then there exists an 
algebraic extension &' of R such that Ug, is a diagonal algebra if and only if U is a 
direct sum of separable fields. 


s 
THEOREM 2.8. Jf Wis a commutative semi-simple algebra over an algebraically 
closed field, then % is isomorphic to a diagonal algebra. 


3. Theorems of Frobenius and McCoy. 


THEOREM 3.1 [4]. Let A; (¢ = 1,2,...,k) be a set of commutative matrices. 
Let f (x1, X.,... 5X) be any polynomial with coefficients in §. The characteristic 
roots of A;, 4; (j = 1,2,...,m) may be so ordered that the characteristic roots of 


f(A,,A.,...,Ax) are f(Arj, Aajy ~~ +5 Ans). This ordering is the same for every f. 


Every finite set of matrices (A; ), commutative or otherwise, which enjoys the 
property of the preceding theorem will be said to have the Frobenius Property. 


THEOREM 3.2 [7]. Let (A;)*_, be an arbitrary set of matrices all of the same 
order. Let R = RiA,, A,,..., Ax] denote the algebra of all polynomials in the A,. 
Let N denote the radical of R. A necessary and sufficient condition that (A;) have 
the Frobenius Property is that A,A,—A,A, € N (r,s = 1,2,...,k). 


Before proceeding to prove these theorems we shall indicate the mode of 
approach. If the set of matrices (A;) satisfies the Frobenius condition of com- 
mutativity or the McCoy condition (i.e., that A,A, — A,A, € MN) then by 
Theorem 2.8 and Wedderburn’s Principal Theorem it follows that the algebra 


MATRIC ALGEBRAS 33 


RlA,, A,, ...,A,] is homomorphic to a diagonal algebra, the kernel of the homo- 
morphism being the radical of ®. Every diagonal algebra clearly has the 
Frobenius Property. Therefore, if it is shown that the elements of the radical 
under the operation of addition do not affect the characteristic roots of the ele- 
ments of the algebra, then both of the above theorems will follow readily. 


LEMMA 3.1. Let & be a matric algebra over §. Let N be the radical of U. 
Suppose that the identity matrix I € YA If A € A and N € MR, then A and 
A + N have the same characteristic function. 


Let z be an indeterminate, and J = A, the unit matrix. Following [3], define 
matrices A, and constants c, recursively as follows: 


G = 1, Gq = (— 1, k) tr(AA, ade A, = AA, ; + Col. 
Then we have [3]: 


a-1 “ 
P(z,A)= > Ayr-*, det(sJ] —A) = > c2”" 
k=o k=o 
where P(z, A) is the adjoint polynomial of zJ — A. 
Now if A is replaced by A + N, with N in the radical, the new (A + JN), 
differ from the old A, by elements of the radical, whose trace is zero. Hence 
the constants c, are the same for both A and A + JN, and 


det(2J — A) = det(2J — A — N). 


LemMA 3.2. Let U be a semi-simple commutative algebra. Let (A,)*_. bea set 


of matrices with A; € UA. Then (A,;)_ has the Frobenius Property. 


By Theorem 2.8 %& is isomorphic to a diagonal algebra. But clearly any finite 
set of elements of a diagonal algebra has the Frobenius Property. Hence, be- 
cause of the existing isomorphism, so does (A ,)* 


= 

The proof of the sufficiency part of Theorem 3.2, from which Theorem 3.1 
follows, may now be given. From Theorem 2.2 and Wedderburn’s Principal 
Theorem it follows that R = KR’ + N where MR’ [= R — RN. Since A,A,—A,A, 
€ MN, it follows that M’ is a commutative semi-simple algebra. Thus 


A,=A.+N,, AL ER, Ni ER. 


By Lemma 3.1 the characteristic roots of A, are the same as those of A,. By 
Lemma 3.2 there exists a unique ordering of the roots, \,;, of the A. such that 


for every polynomial f(x,, x,, . . . , x,) the characteristic roots of f(A 3 A # ake A’) 
are f(A. ;, Asin» +> Ans) (7 = 1,2,...,”). Note now that 
, , ; , , , y 
f(A,, 4,,...,4,) = f(A, — N,, A. — Ns,..., Ae — Ms) 
2 ee er 14.) + N, N N 
Again by Lemma 3.1 the characteristic roots of f(A,, A,, ..., Ay) are 


f(A, ;, Nos, -- +, Ans). Hence the sufficiency of the stated condition has been shown. 











34 J. K. GOLDHABER 


The proof of the necessity of the condition of Theorem 3.2 is immediate. For 
if an ordering of the roots exists then clearly all the roots of 
S(A,, As,...,Az)-[A,A, — A,A,] 
are zero for every f(A,,A,,...,As) © R so that A,A, — A,A, is properly 
nilpotent in R and hence is in N. 
It may be interesting to state Theorem 3.2 in the following equivalent form: 


THEOREM 3.2a. A mecessary and sufficient condition that a set of matrices 
(A,)*_, have the Frobenius Property is that there exists a homomorphism of the 
algebra R = RlA,, A,, ..., Ax], with kernel the radical of R, onto a diagonal 
algebra. 


4. Concerning characteristic vectors. Rademacher [8] proved Frobenius’s 
Theorem, our Theorem 3.1, by first proving: 


THEOREM 4.1. Let (A,)*_. be a set of commutative matrices. Then there exists 
a set of numbers (u;)*_. and a row vector , such that 


VA; = pw So. Se 
A row vector y # 0 which has the property that YA; = ww (i = 1,2,..., R) 
is called a characteristic row vector associated with the set (A;*_.. A character- 


istic column vector associated with (A,;)*_. may be defined similarly. 
A more general form of the above theorem is given in: 


THEOREM 4.2. Suppose that A,A, — A,A, (r,s = 1,2,...,k) is inthe radical, 
MN, of R = RA,, A,,..., Ax]. Let n. (n,) denote the nullity of the column 
(row) space of N. Then there are exactly n, (n,) linearly independent character- 
istic row (column) vectors associated with (A ,)" 


6/j03° 

As above, R = MR’ + N where MR’ [= R — N and where K’ is a commutative 
semi-simple algebra. By Theorem 2.8 it may be assumed without loss of 
generality that 9’ is a diagonal algebra. 

Since the nullity of the column space of is ,, there exists a matrix H of 
rank n, such that HN = 0, for every N € Mt. Clearly the row vectors of H 
form a basis for the complement of the column space of 9; that is, if @ is a row 
vector such that ¢N = 0 for every N € M, then ¢ is a linear combination of the 
row vectors of H; for otherwise the nullity of the column space of 3t would be 
greater than n.,. 

A matrix is in Hermite form if it is ‘‘triangular with zeros above the diagonal; 
with every diagonal element either zero or one; if the diagonal element in any 
row is zero, the entire row is zero; if the diagonal element in any column is one, 
every other element of the column is zero’’ [6, p. 35]. 

It may be assumed that H is in Hermite form; for otherwise one may multiply 
H on the left by a non-singular matrix P which brings H into Hermite form [6, 
p. 35] and then (PH)N = H’N = 0 for every N € N. It will be shown that 
each of the m, non-zero row vectors of H is a characteristic row vector. 





MATRIC ALGEBRAS 35 


Now A; = D; + N,; (i =1,2,...,%), where D,; is diagonal and N, € ®. 
Note also that since D,N € ® for every N € & it follows that (HD,)N = 
H(D,N) = 0 for every N € ®; and hence it is true that every row vector of 
HD, is a linear combination of the row vectors of H. We may therefore write 
HD, = LH. If z; isa number such that the diagonal matrix B; = z,J + D, is 
non-singular, then 

HB, = (2,1 + L)H, 
B;*HB,; = B;*(2,J + L)H. 


The matrix B; ‘HB, on the left is in Hermite form, since this form is still retained 
after transforming H by a non-singular diagonal matrix. Since the Hermite 
form is unique, the right member, which is a left multiple of H, can be in Hermite 
form only if it is equal to H. Hence B, and, consequently, D; are commutative 
with H. From the equation 


HA, = HD, = DH 


it follows, that if y, is a non-vanishing row vector occupying the kth row of H 
and \,, is the kth diagonal element of D,;, then 


WiAs = Ae 
Thus the n, linearly independent vectors y, are characteristic vectors of each 
of the matrices A ,. 
Suppose now that y is any characteristic row vector associated with (A, )*_ 
WA; = Xx. Let N be any element of M. Since N € RIA,,A,,...,A,] it is 
true that N = f(A,,A,,...,Axz). Thus 


ON = ¥f(A,, Ass. -~ 4A) = Sas Aoy- s+» Aa 


But since N € ¥t, N has only zero as a characteristic root, and hence 
f(y, Aw» - + -, Ae) = O. Therefore y annihilates every element of N. But this 
means that y is a linear combination of the m, vectors y, considered above. This 
completes the proof of the theorem. 


In the example given below n, # n,. This indicates that one cannot in 
general expect to get an expression for m. or mn, in terms of the Weyr or Segre 
characteristics of the matrices involved; for the latter invariants do not differ- 
entiate between the structure of the row spaces and the column spaces of the 
matrices. 














T1001] T1001] 
Example: A,=11101], A,=|010 
1001) 1014 
r0 00] r0 0 0] 
N,=11001], N,=|000}], 
1000) 1100] 


R= R(A,,A,], R= RZ], N= RIN,, N,], 
A,=I+WN,, A,=I+N,. 














36 J. K. GOLDHABER 


Thus n, = 1,, = 2. Also note that 


{1, 0, 0] - A, = 1- [1, 0, 0] (¢ = 1, 2), 
0 0 0 0 
A;-}1] =1-]1], A,-| 0] =1-] 0 (2 = 1, 2). 
0 0 | 1 1 


5. A mapping theorem. & is said to be a module over § if ¥ is a linear subset 
of an algebra over §. 

Let ¥ and 9) be modules over §. Let ® be a mapping of ¥ onto ¥) which satis- 
fies the following conditions: 


, 


(1) If ¥ has a unit e, then 9) has a unit ¢’, and ®(e) = e’. 


k k 
C: (2) @ is linear, ie., if X; € X anda, € F, then (>> a,X,;) = Yo a, H(X,j). 


i: i_ 


k k 
(3) If X; € Xand if IL X, = 0, then II] #(X,) = 0. 


THEOREM 5.1. Let I be a total matric algebra over §, Y) a module over §, and 
© a mapping of M onto Y) which satisfies conditions C. If A, A’ © M then 
@(A - A’) = 6(A)®(A’). Thus 9) is an algebra and & maps M homomorphi- 
cally onto ¥). 

M has a basis E;; (7,7 = 1,2,...,m) where E£,;E,,, = 6;,E;, and where 6;, 
is Kronecker’s delta. 


Since ® is linear it will be sufficient to show that * 


®(E; Ein) = B(E;;)P(Ein) = 6;,0( Ein). 
If J is the unit matrix, each of the following products vanishes: 
E,;,E..n = 90 for 7 # k, 
(Ey; — DEu = Ey.En — TEx = 9, 

(Ei; in E;;) (Ex + E jx) —_ | OF OP T EE in =e EF, jE — E,;E i. = 0. 
Hence the image under the mapping ® of each of these products vanishes. We 
obtain successively: 

O(E,,;)O(E,,,.) = 0 for; #k, 
O(E;,)P(E,,) = (I) O(E,,) = B(E,y), 

©(E,;)®(E;,) = P(E,;)0(Ey,) + B(E,,) O(Z;,) — O(E,,;)P(En) = P(E). 

THEOREM 5.2. Let Mand B be algebras over § with radicals N and N’ respective- 
ly. Let © be a mapping of A onto B which satisfies conditions C. If A, A’E A 
then (A - A’) = $(A)®(A’) mod 9’. 

By Theorem 2.5, % = G@+ MN whereS =M, @OMO...@M,. Thus if 
A € Athen A is uniquely expressibleas A = S + N,whereS € © andN € &R. 


MATRIC ALGEBRAS 37 


(1) If N € MN, then (NV) € N’. For suppose that NV € MN. It is sufficient 
to show that if B € %, then [B®(N)]' = 0 for some positive integer /. Since 
® maps % onto % it follows that if B € B then there exists an A € W such that 
B= (A). Since N € & it is true that there exists an / such that 


[AN] = ANAN...AN = 0. 


By C,, ®(A)®(N) ... ®(A)®(N) = 0, or [6(A)@(N)]' = [BO(N)]}' = 0. 
Therefore (NV) € MN’. 


(2) With the use of Theorem 5.1 it may be shown quite easily that if 'S, S’ € S, 
then ©(S)#(S’) = #(S.S’). 


(3) Suppose now that A, A’ € W&. Then 


A=S+N, A’=S+N', SS’ €S, NN EN 
AA’ =S"+N" =(A—N)(A’—N’)+N", S”",(A-—N),(A’—N’) § 


Ay . 


NY” EN. 
Using (1) and (2) we may write 


(AA’)=6(A—N)(A’—N’)+ B(N”), 
(A)®(A’) = [(6(A—N)+ 8(N)][(A’—N’) + B(N’)], 
6(AA’)— 6(A)&(A’) = &(N”) — 8(N) 8(A’—N’) — (A —N) ®(N’) 
—(N)(N’)=0 mod &’. 


In the preceeding theorem it has been assumed that the field § was algebraic- 
ally closed. The following example shows that Theorem 5.2 is not necessarily 
true if the field is not algebraically closed. Thus some condition on § is neces- 
sary. It may be proved that the theorem still holds if the condition of algebraic 
closure is replaced by the somewhat weaker condition that the characteristic 
roots of every element of & all lie in §. 

Let Ra denote the rational field and let & be an algebra over Ra with basis 
elements J and A, where J is the identity and A? = —J. Define a mapping ® 
of % onto W as follows: 


(al + bA) = (a + 56) + DA, a,b € Ra. 


Clearly (J) = J; & is linear; and since A has no proper zero divisors, ® satis- 
fies C, vacuously. The radical of Wis zero. Note however that 


(A?) = 6(— I) = —J+ ©(A)*(A) = 2A. 


Note also that if the complex field were used instead of the rational field, and # 
defined similarly, then condition C, would not be satisfied. For 


(iI + A)(GiI — A) =0 (f) = ~ 1), 


whereas 


&(i] + A)®(GiIJ — A) = [4+ 197+ A][Gi — 1)7 — A] = — 1. 














€ 


38 J. K. GOLDHABER 


6. The assignment of a common order to the characteristic roots of certain 
sets of matrices. Let &f be a subalgebra of a total matric algebra I of order n? 
over an algebraically closed field §. Suppose that the identity matrix J is in YW. 
Let A;; (7 = 1,2,...,) denote the characteristic roots of A; € W. 

Y is said to have property P, if the characteristic roots of every pair of matrices 
A,, A, € & may be so ordered that the characteristic roots of A, + A, are 
b+ Aa o's, 2....00 


W is said to have property P, if the characteristic roots of every finite set of 


k 
matrices (A;)*_. € %& may be so ordered that the characteristic roots of > a,A, 


i—1 


& 
are >> aA,; G = 1,2,...,) for alla, € §. 


The ordering of the roots in property P, is not assumed to be unique. It is 
conceivable, a priori, that the jth characteristic root of A, + A, is \,; + A,,; 
but that for some a € § the jth characteristic root of A, + aA, is \,; + ar\x; 
thus it seems possible that the jth root of A, may associate with the jth root of 
A, but that for some a € § the jth root of A, will associate with the kth root 
of aA,. That this is not so is proved in 


LEMMA 6.1. Suppose thatY has property P,. Suppose that the jth character- 
istic root of A, + A, isd,; +.;(7 = 1,2,...,m). Then the jth characteristic 
root of aA, + bA, is ad,; + bd,; foralla,b € F (j = 1,2,...,n). 


Denote the jth characteristic root of 


cA, + [(a@ — c)A, + DA,], (a — c)A, + 0A,, and aA, + dA, 


by 


OK; + (@ — C)Ay + Drum, (@ — CA: + OA.m, and adr,, + ddr,, 


respectively. It would seem that the subscripts /, m, p, and g depend on the 
values of j, a, 6, and c; that is, / = /(j, a, b, c), m = m(j, a, b,c), p = pj, a, b,c), 
gq = q(j, a, 6, c), where the functions involved are integral valued and assume 
values only between 1 and m inclusive. Since 


cA, + [(a—c)A, + bA,] = aA, + DA,, 
it is true that 


(6.1) A, j + (a — C)Asu; a,b,c) + Shsts.e.t a = OQ, 93,0, 6, ¢) + Drsati. b,¢)* 


Now let j, b, and c be arbitrary but fixed. Consider the quadruplet of integers 
{l(a), m(a), p(a), g(a)]. Since l(a), m(a), p(a), and g(a) are integers between 1 
and n it follows that at most m‘ distinct quadruplets can be obtained by letting 
arun over §. Since § is algebraically closed it is an infinite field and hence there 
exist an infinite number of distinct a, € § such that [/(a,), m(a,), p(a,), g(a,)] 
= [/,, m., P., Go] for some fixed /,, m,, ~., and g,. Thus for an infinite number 
of distinct a; € ¥ 


(6.2) Orns HH (Gs — C)Artg + Drame = Argo + Ase: 





we 


ae iF eS Ooo 


MATRIC ALGEBRAS 39 


From (6.2) it follows immediately that 
(6.3) Arty = Arne 


Furthermore, \,1:;.) may be taken equal to X,;,, and X,,.., may be taken equal to 
\,» for alla € ¥. For 


det(cA, + (x — o)A, + DA, — AyD — (x — oA] — Dd,,.D) 
= det(xA, + DA, — xiX,,,J — Dd,,,J) = 0 


3@0 


for an infinite number of distinct x i. Hence the above determinants are 
identically zero, and thus cA, + (a, —c)A, + bA, and a,A, + bA, have respec- 
tively the characteristic roots cd,; + (a;— c)A,1,+ DA, and a,r,,,+ bA,,, for all 
a € §. Consequently one may, without loss of generality, redefine the functions 
l, m, p, and g so that [/(a,), m(a,), p(a;), g(a,)] = [l., m., Po, Go] for all a, € §. 
From this and the fact that the choice of j, 6, and c was arbitrary it follows from 
(6.3) that 


2@0 


Ar204.0.8.00 ™ Arstic.s.-) for all a,b,c E F (Co ee | 
Similarly if j, a, and c are kept fixed it can be shown that 


Asmti.c.8,e) ™Acets.e.t for all a, b, c Rn (fe aaa | 


It has been proved that if the jth root of (a — c)A, + bA, is (a — c)\y,+ DA, 
then the jth root of aA, + bA,isaX,,; + Dd,,. But the jth root of 


[a—(a—1)] A, + A, 


is \,; + ,,, and hence the jth root of aA, +A, is ad,; +X,,;. Applying 
the same process to [b — (6—1)]A, + aA, one obtains the desired result 
that the jth root of aA,+ DA, is ad,,;+ bdd,; for alla,b € § (fj = 1,2,...,m). 


LemMMA 6.2. Suppose that A has property P,. Suppose that the jth charac- 
teristic root of aA,+ bA, is ad,;+ bd,; and that the jth characteristic root of aA, 
+ bA, is ad,; + bd; for alla,b € § (Gf = 1,2,...,m). Then the jth charac- 
teristic root of aA,+ bA,+ cA, is ad,;+ bd,;+ A,,, for all a, b,c iF. 


Note that aA,+bA,+cA, = [aA,+6A,]+cA, = aA,+[bA,+cA,]. Then as 
in Lemma 6.1, @d,;-+ Dda54 CAgij.0.8.c) = Brpim.c.b.c) 1 ONam(j.0.b.<) 1 CAsm(i.0.b. 0° 
Now keep j, a, and 6 fixed and consider the triplet [/(c), m(c), p(m, c)]. 
Proceeding as in Lemma 6.1, one obtains that A,.;;.4.5.-) = Asmj.0.s.. for all 
a,b,c € § (j = 1,2,...,m). Similarly keeping j, a, and c fixed gives the result 
that A,; = Asmcj.c.6.2- From these facts the desired result follows readily. 


THEOREM 6.1. Properties P, and P, are equivalent. 


Clearly P, implies P,. The fact that P, implies P, follows from a simple in- 
duction on the number of matrices in Lemma 6.2. 














40 J. K. GOLDHABER 


THEOREM 6.2. Property P, and the Frobenius Property are equivalent. 


Obviously the Frobenius Property implies property P,. Suppose that & has 
property P,. If it is shown that there exists a mapping ® which 


(a) maps & onto an algebra $8 which is semi-simple and has the Frobenius 
Property, 


(b) preserves characteristic roots, i.e., A and (A) have the same character- 
istic roots, and 


(c) satisfies conditions C, 
then it will follow from Theorem 5.2 that %&{ has the Frobenius Property. 
A mapping © satisfying these conditions will now be shown to exist. 


Let E; (¢ = 1,2,...,%) bea basis for A. Let p,; (j = 1,2,...,m) denote 
the characteristic roots of E;. Define 


Pas 0 0 eee 0 
0 oe sao 
®@(E,=|0 0 pa... 0 
lo 0 S «sete 
k k 
where the p;; are so ordered that #(>° a,E£;) = > a,;#(E£,) foralla, € §. Since 


A satisfies P, this is possible. Let 8 be the set of all matrices (#(A)) with 


AE YW. 
(a) B is a semi-simple algebra with the Frobenius Property. 


To prove that % is an algebra it will be sufficient to show that if A,, A, € OY, 
then there exists an A, € & such that ®(A,) = 6(A,)®(A,), ie., that & is 
closed under multiplication. Now if A,,A, € WY, then since & has property 


P, it is true that the jth characteristic root of 
a(A, + A,)? + 0(A, + A.) + ¢c(A,? + A,’) 


is a(A,; + A,;)? + O(A,; + A.;) + cA. +A,,7). Letting a = 3, b= 0, and 
c = —4 one obtains the result that (3[A,A, + A,A,]) = ®(A,) ®(A,). 
Thus $ is an algebra. Furthermore, since 8 consists of diagonal matrices 


only, it is semi-simple and has the Frobenius Property. 
(b) , by construction, preserves characteristic roots. 


(c) ® satisfies conditions C. 
(1) J € Aand (J) = J, so that ® satisfies C,. 
(2) #, by construction is linear. Hence ® satisfies C,. 
(3) It is required to show that if Il A, = 0, then Il #(A;) = 0; by the 


i_ imi 


MATRIC ALGEBRAS 41 


A a 
construction of # it is thus required to prove that if 11 A; = 0, then I1,, = 0 


i= im. 


A 
(j = 1,2,...,), where the A,, are so ordered that }-a,A,; has characteristic 
rh i-_) 
roots >> a,A,;;.. This shall be proved by an induction on h. 
Let h = 2and suppose that A,A, = 0. ConsiderA,A, = N. Nisnilpotent; 
for N? =0. Furthermore, if f(A,,A,) € R[A,,A,] then since A,A, = 0, 
f(A,,A,) - [A,A,] = a,A‘‘A,, where r; > 0 for all i. But ($[a,A‘‘A,)? = 0. 


Therefore N = A,A, is the radical of R[A,,A,]. Then A + A,A, = N is 
in the radical of R[A,, A,] and by Theorem 3.2, d,,A,; = 0 (j = 1,2,...,n), 


where the A,; are so ordered that the characteristic roots of a,A, + a,A, are 
a.\,; + a,d,; for all a,,a, € ¥. 


LY h 
Assume now that if I] A, = 0, then II Ay; = O (7 = 1,2,...,m). Suppose 


t=) t-— 


that ILA, = 0. Then (A,A,) ILA, = 0. By the induction assumption 


‘1! t—s 


U 5 
u; 11 d,; = 0 where yu, is the characteristic root of A,A, associated with 


if 
i=—s 


A+ 
(¢ = 1,2,...,4+ 1). Suppose that for some j, IT y,, #0. Then yu; = 0. 


It must be shown that either A,,; or A,; (or both) equals zero. 
Consider the matrix 


B, = A,A, — a/(a — 1) -,;A, — ad,;A, + a?/(a — 1) «2, A, 57 
= [A, — a\,,I][A, — a/(a — 1) - X,,J], a = 1. 


Since yw; = 0 and since & has property P,, B, has for each a € ¥,a#1,a 
characteristic root equal to zero. Thus for every a ¥ 1 there exists a vector 
@, ~ 0 such that B,d@, = 0. Thus 


[A, — ad, ,I|[A, — a/(a@ — 1) -X,,;J]¢, = 0. 
Let [A, — a/(a — 1) -A,;J]@ = ¥.. Now clearly if ¥, # 0, then 
[A, — a\,,J] ¥ = 0. 


Thus either [A, — a/(a — 1) -,,J]@ = 0, @ #0 for an infinite number 
of distinct a € §, or [A, — ad,,;J\¥. = 0, ¥. #0 for an infinite number 
of distinct a € § (or both). Suppose, say, that [A, — a/(a — 1) - 
\,;,J]¢, = 0, ¢ ~ 0 for an infinite number of distinct a € §. Then A, has 
characteristic roots a/(a — 1) -X,; for an infinite number of distinct a € §. 
But A, has only a finite number of distinct characteristic roots. Therefore, for 
some d,, @,,@, * a, it is true that a,/(a, — 1) -A,; = a,/(a, — 1)-,;. From 
this it follows that A,; = 0. 
By induction it follows that ® satisfies C,. Hence the theorem. 








42 J. K. GOLDHABER 


As a corollary to the two preceeding theorems we have: 


THEOREM 6.3. P,, P,, and the Frobenius Property are equivalent. 


REFERENCES 


— 


. A. A. Albert, Structure of Algebras (New York, 1946). 

2. H. E. Fettis, A method for obtaining the characteristic equation of a matrix and computing 
the associated modal columns, Quarterly J. Appl. Math., vol 8 (1950), 206-212. 

3. J. S. Frame, A simple recursion formula for inverting a matrix, Abstract 471, Bull. 
Amer. Math. Soc., vol. 55 (1949), 1045. 

4. G. Frobenius, Uber vertauschbare Matrizen, Sitz. preuss. Akad. Wiss. (1896), 601-614 

5. C. C. MacDuffee, The theory of matrices, Ergeb. der Math., vol. 2 (1933). 

6. ———, Vectors and matrices, Carus Mathematical Monograph no. 7 (1943). 

7. N. H. McCoy, On the characteristic roots of matric polynomials, Bull. Amer. Math. Soc., 

vol. 42 (1936), 592-600. 

8. H. Rademacher, On a theorem of Frobenius, Studies and Essays presented to R. Courant 
(New York, 1948), 301-305. 

- J. Williamson, The simultaneous reduction of two matrices to triangular form, Amer. J. 
Math., vol. 57 (1935), 281-293. 


<- 


University of Wisconsin 








CONTRIBUTIONS TO NONCOMMUTATIVE 
IDEAL THEORY 


D. C. MURDOCH 


Introduction. The well-known results of Krull concerning the minimal prime 
divisors and the radical of an ideal in a commutative ring have been extended to 
the noncommutative case in a recent paper [5] by N. H. McCoy. In that paper 
systematic use was made of the concept of an m-system, a set M of elements of 
the ring such that if a © M and b € M then axb € M for some element 
x of the ring. The m-system plays the same role in the noncommutative 
case that the multiplicatively closed system plays in the theory of Krull. 
For example, an ideal in a noncommutative ring is prime if and only if its com- 
plement is an m-system. What follows is an attempt based on the methods of 
McCoy to extend more of the Krull-Noether theory of commutative rings to 
the noncommutative case. Different treatments of the noncommutative case 
have previously been published by Krull [2], and Fitting [1]. Since the point 
of view of the present paper, however, is considerably different from that of 
either of these previous ones, little or no use has been made of their results. The 
results and methods of McCoy [5], on the other hand, have been used extensively. 

The concept of an isolated component ideal (Krull [3] and [4]) leads in the 
noncommutative case to upper and lower right (or left) isolated component 
ideals each of which retains some of the properties of the isolated component 
ideals of the commutative case. These upper and lower components and the 
relations between them are investigated in §§ 2,3 and 4. The results of these 
sections follow without any assumptions of finite chain conditions. The effect 
of descending and ascending chain conditions is considered in §5 and the latter 
is assumed in the remainder of the paper. Right primary ideals are defined in 
a manner which ensures, in the presence of either chain condition, that the 
radical of a right primary ideal is prime. The term radical is used throughout 
in the sense of McCoy [5]. Examples are given which show that not every 
ideal is representable as the intersection of a finite number of right primary ideals 
but any ideal which is so representable has a short representation and for short 
representations the same uniqueness theorems hold as in the commutative case. 
Thus in any two short representations of an ideal a as the intersection of right 
primary ideals the number of primary components is the same and the radicals 
of these coincide in some order. Moreover, the isolated primary components 
are uniquely determined and must occur in any such representation of a. 


1. Definitions and basic concepts. Let R be an arbitrary noncommutative 
ring. An ideal p in R is prime if ab C p implies either a C p or 6 C p, where a 





Received September 22, 1950. 











44 D. C. MURDOCH 


and 6 are any ideals of R. It has been shown by McCoy [5] that an ideal p 
is prime if and only if, for any elements a, 6 of R, aRb C » implies that either 
a or b belongs to p. 

DEFINITION 1.1. A set M of elements of R is called an m-system if for any two 
elements a and b of M, there exists an element x of R such thataxb € M. The null 
set is also defined to be an m-system (McCoy [5]). 

It is clear from the above remark and from the definition that an ideal is 
prime if and only if its complement in R is an m-system. 

DEFINITION 1.2. Am element a of R is said to be right prime to an ideal a if 
xRa € a implies that x € a. An ideal b is right prime to a if it contains an 
element which is right prime to a. 

Elements and ideals left prime to a can be defined in the obvious way but the 
left hand definitions and theorems will usually be omitted. 

DEFINITION 1.3. Jf a and b are ideals in R, the ideal consisting of all elements 
x of R such that xRb € a for all b in 6 is called the right ideal quotient of a by b and 
ts denoted by ab™'. Similarly 6 ‘a consists of all x in R such that bRx € a for all 
b in b. 

It is obvious that ab-! and 6" 'a always contain a and that if b is right prime 
to a then ab"! = a. 

DEFINITION 1.4. If M is a non-null m-system, a set N of elements of R is 
called a right n-system associated with M (briefily a right M-n-system) if N contains 
M and if for every m in M and every n in N there exists an element x of R such that 
nxm € N. If M 1s the null set the only right M-n-system is, by definition, the null 
set itself. 

We note that every m-system is a right (or left) m-system associated with 
itself. Moreover, the set-theoretic union of a finite or infinite number of right 
n-systems all of which are associated with the same m-system WM, is again a 
right m-system associated with M. However it may also be associated with a 
larger m-system, properly containing M. As an illustration of this let R be the 
ring of integers, any prime, and M the m-system consisting of all integers 
prime to p. Let N; be the set of all integers which are not divisible by /’° 
(i = 2,3,4,...). Each N; isan M-n-system. The union of all N; is the set 
of all non-zero integers and is itself an m-system M. It is therefore an 
M-n-system where MD M. We remark also that if, in a commutative ring, q 
isa primary ideal and p its associated prime, then M = C(p) (complement of p 
in R) is an m-system and N = C(q) is an M-n-system. 


2. Upper isolated component ideals. 


DEFINITION 2.1. Jf ats any ideal in R and M is an m-system which does not 
meet a (i.e. has no elements in common with a), the right upper M-component of a 
is defined to be the set of all elements x of R having the property that every right 
M-n-system which contains x meets a. 

The right upper M-component of a will be denoted by u(a, M). In order to 
show that u(a, M) is an ideal we shall require the following lemmas. 





NONCOMMUTATIVE IDEAL THEORY 45 


LemMA 1. Jf ais any ideal and M an m-system which does not meet a then there 
exists a maximal right M-n-system N which does not meet a and N is uniquely 
determined by M and a. 


Proof. There exists at least one right M-n-system which does not meet a, 
namely M itself. The union N of all such right M-n-systems therefore satisfies 
the requirements of the lemma. 


LemMMA 2. Let M be any m-system and N any right M-n-system. Let a be an 
ideal which does not meet N. Then a is contained in a maximal ideal q* which 
does not meet N and q* has the property that if aRb € q* and b € M, thena € q*. 


Proof. Since the union of any linearly ordered set of ideals which do not 
meet JN is an ideal which does not meet N, the existence of q* follows from Zorn’s 
Lemma [6, p. 101]. 

Now suppose a is an element of R which does not belong to q*. Then (a, q*) 
properly contains q* and hence by the maximal property of q* must contain an 
element of N. Thus 


n=ia+ra+ar+)> rar;+q 
‘4 
where 7 is an integer, 7, r’, r;, 7; are elements of R and g € q*. Now if d M 
there exists an element x of R such that mxb € N where 


nxb = iaxb + raxb + ar'xb + ¥ r,ar;xb + gxb. 
ii 
But if aRb C q* every element in the sum on the right hand side of this equation 
belongs to q*, and nxb belongs to both N and q*, contrary to the definition of q*. 
Hence if aRb C q* and 6 € M we must have a € q*, as required. 

Lemma 2 states that every element of M is right prime to q*. It will be con- 
venient to refer to this property by saying that q* has property (A) relative 
to M. We remark also that if an ideal has property (A) relative to M then its 
complement in R is a right M-n-system and conversely. 


LemMMA 3. Let a be an ideal and M an m-system which does not meeta. A set 
q of elements of R is a minimal ideal containing a and having property (A) relative 
to M if and only if C(q) is a maximal right M-n-system which does not meet a. 


Proof. (i) First suppose C(q) is a maximal right M-n-system which does not 
meet a. By Lemma 2, a is contained in a maximal ideal q* which does not meet 
C(q). Moreover, q* has property (A) relative to M and hence C(q*) is a right 
M-n-system which does not meet a. Since q* does not meet C(q) we have 
C(q) © C(q*) and hence, from the maximal property of C(q), it follows that 
C(q) = C(q*) and q = q*. Thus q is an ideal with property (A) relative to M. 
Finally q is a minimal such ideal, for if q D q’ D> a where q’ has property (A) 
relative to M then C(q’) is a right M-n-system which does not meet a and proper- 
ly contains C(q), contrary to the maximal property of C(q). 











46 D. C. MURDOCH 


(ii) Conversely, suppose q is a minimal ideal containing a and having property 
(A) relative to M. Then C(q) is a right M-n-system which does not meet a, 
and by Lemma 1 is contained in a maximal such right M-n-system N. Hence 
by (i) proved above C(J) is a minimal ideal containing a and having property 
(A) relative to M and since C(q) C N, q > C(N) and by the minimal property 
of gq, q = C(N) whence C(q) is a maximal right M-n-system which does not meet 
a. This completes the proof. 


THEOREM 1. The right upper isolated M-component u(a, M) of a is an ideal. 
Its complement in R 1s the uniquely determined maximal right M-n-system which 
does not meet a, and u(a, M) itself is the crosscut of all ideals containing a which 
have property (A) relative to M. 


Proof. Let M bean m-system which does not meet a and let N be the maximal 
right M-n-system not meeting a whose existence is assured by Lemma 1. By 
Lemma 3, gq = C(N) is a minimal ideal containing a and having property (A) 
relative to M. Since the crosscut of any set of ideals containing a and having 
property (A) again has property (A) it follows that there is a unique minimal 
such ideal which must be equal to q and hence q is the crosscut of all ideals con- 
taining a which have property (A) relative to M. It remains to prove that 
q = u(a, M). 

First, qC u(a, M). For if x € q then x does not belong to NV, the maximal 
right M-n-system which does not meet a. Hence every M-n-system which 
contains x meets a and x € u(a, M). On the other hand, u(a, M) Cq. For 
if x € u(a, M) then x cannot belong to N and must belong to q. Hence 
u(a, M) = q and the theorem is proved. 


CoroLiary 1. Jf a 2b and M is an m-system which does not meet a then 
u(a, M) > u(b, M). 


Coro.uary 2. Jf M,, M,, are m-systems which do not meet a and if M, D> M, 
then u(a,M,) > u(a, M,). 


Proof. Every M,-n-system is also an M,-n-system and hence the maximal 
M,-n-system which does not meet a is contained in the maximal such M,-n- 
system. Taking complements, 


u(a, M,) > u(a, M,). 


If p is a prime ideal which divides a and if M = C(p), then u(a, M) will also 
be referred to as the right upper p-component of a and will also be denoted, when 
convenient, by u(a, p). 


3. Lower isolated component ideals. In this section we shall define a right 
lower isolated component of an ideal a, and we shall investigate its relationship 
to the upper isolated component discussed in the previous section. 

DEFINITION 3.1. Jf a is any ideal in R and M any m-system which does not 
meet a, the right lower isolated component of a coresponding to M, or briefly the right 





NONCOMMUTATIVE IDEAL THEORY 47 


lower M-component of a, is defined to be the set of all elements x of R such that 
xRm € a for some element m of M. 

The right lower M-component of a will be denoted by I(a, M). It is clear 
that I(a, M) is an ideal, for if x € (a, M) certainly, — x, and rx and xr belong 
tol(a, M) forallrin R. Also ifx,Rm, C aand x,Rm, C a where m,, m, belong 
to M, then m,rm, € M for some r in R and 


(x, + x,)Rm,rm, C x,.Rm,rm, + x,Rm, C a. 
Hence x, + x, € [(a, M). 


THEOREM 2. If a is an ideal and M an m-system which does not meet a then 
u(a, M) > I(a, M) Da. 


Proof. \ifx € (a, M) then xRm C a for some element m of M. Hence every 
right M-n-system which contains x meetsaandx € u(a,M). Thatl(a,M) Da 
is obvious from the definition. 


THEOREM 3. (a) ulu(a, M), M] = u(a, M), 
(b) {{u(a, MW), M] = u(a, M), 
(c) ufl(a, M), M] = u(a, M). 


Proof. (a) The complement in R of u(a, M) is a right M-n-system N and 
hence is certainly the maximal such that does not meet u(a, M). Hence by 
Theorem 1, C(V) = u(a, M) is the right upper M-component of u(a, M). 

(b) The ideal {{u(a, @), M] consists of all elements x of R such that xRm 
C u(a, M) for some min M. But since u(a, M) has property (A) relative to M 
this implies that x € u(a, M). Hence [[u(a, M), M] Cu(a, M). Since, by 
Theorem 2, u(a, M) € [[u(a, M), M], the equality follows. 

(c) If x € ufl(a, M), M] then every right M-n-system N which contains x 
meets [(a, M), that is, N contains an element 2 such that 2Rm C a for some m 
inM. Butsincen€ Nandm€ M,nurm€ Nforsomer inR. Hence N meets a 
and x € u(a, M) and we have u[[(a, M), M] Cu(a, M). But since aC l(a, M), 
by Corollary 1, Theorem 1, u(a, M@) € u[{l[(a, M), M] and the equality follows. 


DEFINITION 3.2. For all ordinal numbers a we define the ideal {*(a, M) by in- 
duction as follows: (a, M) = ((a, M). Jf a is not a limit ordinal, (*(a, M) 
= [[[*-'(a, M), M], while if a is a limit ordinal, {*(a, M) is the union of ail \"(a, M) 
for which o < a. 


It is clear that [*(a, M) D I°(a, M) if o < a. 
THEOREM 4. For all ordinal numbers a, u(a, M) > (*(a, M). 


Proof. By Theorem 2 the result is known fora = 1. Weassume the theorem 
for all ordinals less than a and proceed by induction. 


Case 1. If a is not a limit ordinal and so has an immediate predecessor 
a — 1 we have 














48 D. C. MURDOCII 


[*(a, M) 


{[{*-" (a, M), M] 
C ull*-:(a, M), M] by Theorem 2, 
C ufu(a, M), M] by Corollary 1, Theorem 1, 
= u(a, M) by Theorem 3(a). 
Case 2. If a isa limit ordinal [*(a, M) is the union of all [*(a, M) for o < a. 


Hence if x € [*(a, M) then x € [*(a, M) for ¢ < a and x € u(a, M) by the in- 
duction assumption, and hence [*(a, M4) C u(a, M). 


THEOREM 5. For any ordinal number a, {*(a, M) = {***(a, M) if and only if 
[*(a, M) = u(a, M). 


Proof. (i) If (a, M) = u(a, M) then [***(a, M) = I[u(a, M), M] = u(a, M) 
by Theorem 3(b). 

(ii) If I*(a, M) = I***(a, M), let x be any element of [**!(a, M) so that 
xRm C [{*(a, M) for some element m of M. But under the assumption [*(a, M) 
= [**:(a, M) the condition xRmC [*(a, M) implies x € [*(a,M). Hence 
(a, M) has property (A) relative to M and since u(a, M) is the minimal ideal 
having this property we have [*(a, M) = u(a, M). 


COROLLARY 1. There exists an ordinal number a, finite or transfinite, such that 


(a, M) = u(a, M). 


Since the [’(a, M) are well ordered and the union of every subset of them is 
again an [’(a, M), by Zorn’s lemma they are all contained in a maximal one, 
[*(a, M). Necessarily [*(a, M) = [**:(a, M) = u(a, M) by the theorem. 


CoROLLARY 2. If the ascending chain condition holds in the residue class ring 
R/a then (a, M) = u(a, M) for some finite n. 


Coro.iary 3. If the ascending chain condition holds in R/aandifx € u(a, M) 
then for every element r of R there exists an element m, of M such that xrm, € a. 
The element m, is independent of r if and only if ((a, M) = u(a, M). 


Proof. By Corollary 1, (a, M) = u(a,M). Hence, if x € u(a, M) there 
exists an element m of M such that xRm C f-'(a, M). That is, for every r, 
in R there is an element m(r,) of M such that 


xr,mRm(r,) C -*(a, M). 
Hence for every r, in R there is an element m(r,) such that 
xr,mr,m(r,)Rm(r,) C &-*(a, M). 
Carrying on in this way we find 
xr,mr,m(r,)r,m(r,) . . %n—-1(Ta-.) Rm(r,-,) € a. 
Now 1, can be chosen so that mr,m(r,) € M; 1, so that mr,m(r,)r,m(r,) € M 


and so on. Finally, choose r, so that mr,m(r,)r,m(r,) ... (f,-2)7.m(7,-,) € M 
and the result follows. 


NONCOMMUTATIVE IDEAL THEORY 49 


Finally, if for all x in u(a, M), xrm € a where m is independent of r then 
xRm Ca and x € I(a, M) and u(a, M) = I(a, M). Conversely, if u(a, M) 
= [(a, 7) then there exists an m independent of r and the proof of the corollary 
is complete. 


4. The commutative case. We shall now investigate the relationship of 
[(a, M) and u(a, M) to the isolated component ideals defined by Krull [4, p. 16] 
in a commutative ring. 


THEOREM 6. Jf a is an ideal in a commutative ring R, and M an m-system 
which does not meet a, the set a(M) of all elements x of R for which xm © a for 
some element m of M, is an ideal. 


Proof. lf xm, € a and ym, © a where m, and m, are elements of M, then if 
r is chosen so that m,rm, € M we have 


(x —_ y)m,rm, = xm,rm, — ym,m,r Ca. 


and therefore x — y € a(M). Since obviously cx € a(M) for all c in R, a(M) 
is an ideal. 


DEFINITION 4.1. The ideal a(M) defined in Theorem 6 is called the isolated 
M-componeni of a. 

The isolated component ideal of Krull was defined exactly as in Definition 4.1 
except that M was restricted to be a multiplicatively closed system. Since 
every multiplicatively closed system is an m-system [5] our definition of a(M/) 
coincides with that of Krull whenever the latter applies, that is, whenever M 
is multiplicatively closed. That u(a, M) and [(a, M) may both be considered 
as generalizatious of a(M) to the noncommutative case may now be seen from 
the following result. 


THEOREM 7. If a is any ideal in a commutative ring R, and M is an m-system 
which does not meet a, then u(a, M) = {(a, M) = a(M). 


Proof. Iifx € a(M) then xm € a for some element m of M. Hence, since R 
is commutative, xRm Ca. Therefore x € [(a, M), and a(M) C I(a, M). 

Now if x € u(a, M), every M-n-system which contains x meets a. But the 
set of elements NV = }x, M, xm} containing x, M, and all elements xm where 
m € M, is an M-n-system containing x. Hence N meets a and since M does 
not meet a it follows that xm € afor some element m of M. Therefore x € a(M) 
and we have now u(a, M) Ca(M) Cl(a,M). But by Theorem 2, I[(a, M) 


C u(a, M) and the theorem follows. 


5. Chain conditions. For most of what follows it will be necessary to assume 
that the ring R satisfies the ascending chain condition for two sided ideals. 
Before imposing this restriction, however, we shall develop some consequences 
of the following weak form of the descending chain condition. 








50 D. C. MURDOCH 


CONDITION A. For every ideal a which is not prime, the ring R/a satisfies the 
descending chain condition for two sided ideals. 


THEOREM 8. If 8 is a minimal proper divisor of a then 8-*a is a prime divisor 
of a and is not right prime to a. 


Proof. if 8a = R, it is prime, and since 8R(8-*a) C a and 8 is a proper 
divisor of a it follows that $-‘a is nrp toa. (‘‘nrp” means “not right prime.’’) 

If 8a # R, suppose xRy C 8-*a where x is not in 8*a. Then for every 
element s of 8, sRxRy C€ a, but for some element s’ of 8, s’Rx not Ca. Choose 
rin R so that s’rx is not in a and form the ideal (s’rx, a) which properly contains 
a but is contained in 8. From the minimal property of 8 we have therefore 
8 = (s’rx, a) and every element s of 8 has the form 


s=a+tis’rx + 17,s'rx + s'rxr, + > 1;,s'rxr;, 
where a € a, i is an integer and r,,r,,7;,7; are elements of R. Since s’RxRy Cait 
is clear from the form of the above expression for s that sRy C a and hence 
y€@-‘a. Thus if x is not in 8-*a and xRy C 8-'a then y € $a and hence &'a 
is prime. Since $ contains an element s not in a and since sR(8‘a) C a it is 
clear that $-*a is nrp to a. 


Coro._itary. If condition A holds in R then every ideal a ~ R has a minimal 
prime divisor which is not right prime to a. 


For condition A ensures the existence of a minimal ideal containing a and hence 
a prime p which is nrp toa. This prime must contain a minimal prime divisor 
of a which will also be nrp to a. 


THEOREM 9. If the ascending chain condition holds for two sided ideals in R 
then every ideal ¢ in R has at most a finite number of minimal prime divisors [2}. 


Proof. lf ¢ is a prime ideal the theorem is obvious. If ¢ is not prime there 
exist elements a,, and 6, of R which do not belong to ¢ such that a,Rb, C ce. 
Hence if ¢ is contained in an infinite number of minimal primes p, either a, or d, 
must belong to an infinite number of these. Suppose it is a, and let a, = (a,, ¢). 
Then a, is a proper divisor of ¢, p; > a, for an infinite number of primes p,, and 
each of these p; is a minimal prime divisor of a,. Hence a, cannot be prime. 
Therefore if ¢ has an infinite number of minimal prime divisors it has a proper 
divisor with the same property and continuation of this argument leads to a 
contradiction of the ascending chain condition in R. 


THEOREM 10. Jf the ascending chain condition holds for two sided ideals in R 
and ),, ).,.--,P» are the minimal prime divisors of an ideal ¢ then 


p,Rp,R... Rp Se 


where 1,,1,..+- 5%m tS some finite permutation of the integers 1,2,...,m with 
repetitions allowed. 





NONCOMMUTATIVE IDEAL THEORY 51 


Proof. The theorem is trivially true if ¢ is prime. If ¢ is not prime, there 
exist elements a and 6 not inc such thataRb Cc C p,; (¢ = 1,2,...,m). Hence 
for each i either a € p; or b € p,;. Form the ideals a, = (a,c) and b, = (6, ¢) 
both of which are proper divisors of ¢. Let p’,, p’,,...,»’, be the minimal 
prime divisors of a, and p’’,,p”’,,...,p”, be those of b,. Now suppose that 
both a, and 6, have the property that we wish to prove of ¢, so that p,,Rp,,R... 
Rp;, Ca, and p,,Rp..R... Rp, Cb, and hence, since a,Rb, C ¢, 


p, Rp; R... Rp; RyR... Rp, Ce. 


Now each yp’; and p”,, being a prime divisor of ¢, contains a minimal prime of c, 
and hence p,,Rp,,R... Rp, Gc where p,,,..., Ps, are minimal primes of c. 
Hence if the theorem is false for ¢ it is false for a proper divisor of ¢, and a con- 


tinuation of this argument leads to a contradiction of the ascending chain con- 
dition in R. 


Coro.iary. If the ascending chain condition holds for two sided ideals in R 
then every ideal a # R has a minimal prime divisor which is not right prime to a. 


Proof. lf ais prime but # R then a itself is the required minimal prime. If 
a is not prime, by Theorem 10 we have 


(1) p,Rp.R...Rp, Ca 


where p,,),, . . . , », are (not necessarily distinct) minimal primes of a and s > 1. 
Hence there exists a shortest product of the form (1) which belongs to a; that is, 
there exists an s > 1 such that (1) holds but 


p,Rp.R...Rp,_, not € a. 


It follows that p, is nrp to a. 


6. Primary ideals. In this section we shall require the results of [5] concerning 
the radical of an ideal. The radical r(a) of an ideal a is defined as the set of all 
elements x of R such that every m-system containing x meets a. McCoy has 
shown that r(a) is an ideal and is equal to the intersection of all minimal prime 
divisors of a. 


DEFINITION 6.1. An ideal q is said to be right primary if all elements not in 
t(q) are right prime to q. 

Thus q is right primary if the conditions aRb C q and 6 ¢ r(q) together imply 
act q. 


THEOREM 11. Jf either Condition A or the ascending chain condition holds in R 
then the radical of a right primary ideal is prime. 


Proof. Suppose q is right primary. By the corollaries to Theorems 8 and 
10, if q # Rit has a minimal prime divisor p which is nrp toq. Hence for every 
element ~ of p we have xRp C q for some x not in q. Since q is right primary 














52 D. C. MURDOCH 


this implies that p € r(q) and hence p C r(q). But since r(q) is the intersection 
of the minimal primes of q we have r(q) C p and the theorem follows. Ifq = R 
then r(q) = R and the theorem holds in this case too. 

In rings which satisfy no finite chain conditions it seems possible that right 
primary ideals may exist whose radical is not prime. Such an ideal q, if it 
exists, must be such that all its minimal prime divisors are right prime to q and 
no product of the form p,Rp,R... Rp,, where p,,p,, . . . , p, are (not necessarily 
distinct) minimal prime divisors of q, can belong to q. Since, in a commutative 
ring, every minimal prime divisor of q is nrp to q [6, p. 112] our definition of a 
(right) primary ideal implies a prime radical in the commutative case even 
without chain conditions. In fact, in a commutative ring it reduces to the 
usual definition of a primary ideal by virtue of [6, p. 182, Theorem 59]. 


7. Ideals expressible as the intersection of right primary ideals. In this 
section we shall consider ideals which can be represented as the intersection of 
a finite number of right primary ideals and shall find what characteristics of 
such a representation are uniquely determined by the ideal in question. It will 
be assumed throughout the remainder of the paper that the ascending chain 
condition holds for the two sided ideals of R. A representation 


(2) a=zaq,f\a,f\...(\@, 


of an ideal a as the intersection of right primary ideals, q,, .. . , q, will be called 
an irredundant representation if no one of the ideals q; contains the intersection 
of the remaining ones. 


THEOREM 12. If (2) ts an irredundant representation of an ideal a as the inter- 
section of right primary ideals q,,...,4%,, then an element x is right prime to a if 


s 


and only if x € C(p,;) fori = 1,2,...,7, where p,; ts the radical of 4q;. 


Proof. (i) Ifaisnrp toa then for some element x which is not in a, xRa C a. 
But this implies xRa C q, fori = 1, 2,...,7 while x ¢ q; for at least one value 
of j. Hence, since q; is right primary, a € p;._ It follows that if a € C(p,) 
for all i then a is right prime to a. 

(ii) Conversely, suppose that a is an element of at least one of the primes p, 
and let it be p,. By Theorem 10 some product of the form 


aRaR ...aRa 
is contained in q,. Since the representation a = q, (\q,\...(\4q, is irredun- 
dant we can choose an element } which is contained in q, (\ aq, (\...(C\ 4a, but 


not ing,. Then 


bRaR...aRa Ca. 


Suppose the shortest such product which is contained in a has s factorsa. Then 
s>i1sincebd¢da. If s = 1 then bRa Caand thereforeaisnrptoa. Ifs>1 
then the product bRaR...aRa, with s — 1 factors a, contains an element 0’ 





NONCOMMUTATIVE IDEAL THEORY 53 


which does not belong to a, while b’Ra C a, and again, a is nrp toa. This 
completes the proof. 


THEOREM 13. The intersection of any finite number of right primary ideals all 
of which have the same radical » is a right primary ideal with radical . 


Proof. Let q,,q,,...,4, be right primary ideals all having radical p and let 
q be their intersection. Since p is the only nimimal prime divisor of q; we have 
by Theorem 10 that pRpR ...pRp is contained in each q,; and hence pRpR.. . 
pRp C q. Therefore, if p, is any prime divisor of q we have p, > pRpR... pRp 
and hence p, > »p by the definition of a prime ideal. It follows that p is a unique 
minimal prime divisor of q and therefore p = r(q). Moreover, if aRb C q and 
a ¢q then aRb C gq; for each i while a ¢ q; for at least one 7. Since q; is right 
primary with radical p this implies that b € p = r(q) whence q is right primary 
with radical p. 


THEOREM 14. An irredundant intersection of a finite number of right primary 
ideals not ali of which have the same radical is not a right primary ideal. 


Proof. Let q be an irredundant intersection of right primary ideals q,,Qq,, 

.,q, Whose radicals are p,,p,,...,),. If q is right primary it has a unique 
minimal prime divisor p and all elements not in p are right prime tog. But by 
Theorem 12, if x is right prime to q, x is not contained in any of the primes 
p..).,---,),- Hence C(p) C C(p;) and p> p,; fori = 1,2,...,7. Since p is 
a minimal prime divisor of q it follows that each p; is equal to p. Hence if 
p,.).,...,), are not all equal q is not right primary. 


DEFINITION 7.1. Am irredundant representation (2) of a will be called a 
short representation if none of the ideals obtained by taking the intersection of two 
or more of the ideals q,,q,,...,4, are right primary. 

In view of Theorems 13 and 14 the irredundant representation (2) is a short 
representation of a if and only if no two of the radicals of q,,q,, . . . q, are equal. 


THEOREM 15. Leta = q,f\q,0\...(\4, be an irredundant representation of 
a as the intersection of right primary ideals, and let p, be the radical of q;. If » is 
a prime ideal not equal to R which contains ),,)., ..., but does not contain ,.,, 
oy De, then u(a,p) = a, f\a,0\...0\4. 

Proof. If » >, then by Theorem 1, Corollary 2, 


u(a, p) C u(a, p,). 


But since q; has property (A) relative to the m-system C(p,), Theorem 1 shows 
that u(a, p;) C q,;. Hence 


(3) u(a,p) Cg, Vg, 01)... 014,. 


Now if r = n, (3) gives u(a, p) C a and since u(a, p) > a we have u(a, p) = a 
= q,/\q,0\...\q, and the result is proved in this case. If r < m, since p 














54 D. C. MURDOCH 


does not contain p; for 7 > r, it follows that p does not contain q, either; for since 
p, is the only minimal prime of q, it is contained in every prime which contains 


q;. Hence there exist elements m,,m,,...m,_, such that m; € q,,,; but m,¢» 
(¢=1,2,...,a—r). Nowsince m,,m,,...,m,-_, all belong to the m-system 
M = C(p), there exist elements x,,x,,...,%X,-,-, such that the element m = 
M,X,M,X,M, ...X_---,M,-, is contained in M. Also it is clear that m€ a,., 


(VN Gran \~...0\G. Hence if g € q, 9, 0\...0\4q, we have gRm C a where 
m € M and therefore every right M-n-system which contains g meetsa. Hence 
q € u(a, p) and 

q,-1\4,0\...0\4q, € u(a, p), 


which with (3) gives the result stated in the theorem. 


THEOREM 16. If (2) ts an irredundant representation of a as the intersection 
of right primary ideals q,,4.,... , 4, with radicals p,, ~,,..., P,, then the minimal 
prime divisors of a are exactly those primes which are minimal in the set ,,)., 

«9 Dre 


Proof. For each 1 some product p,;Rp; ... Rp; is contained in q;. Taking 
products over i = 1, 2,...,7, ; 


p,.Rp,,... Rp Ca 


where each p,, is one of the primes p,,p,,...,),. Hence every prime which con- 
tains a contains the above product and therefore must contain one of the primes 
P..P.,---+,P-- Hence every minimal prime containing a is a minimal prime of 
this set and conversely.: 


THEOREM 17. Jf (2) is a short representation of a as the intersection of right 
primary ideals q,,9.,...,4,-, and if'p # Ris any minimal prime divisor of a, then 
u(a, p) zs right primary and equal to one of the q;. 


Proof. Since ) is a minimal prime divisor of a, by Theorem 16 it is the radical 
of one of the ideals q;, say q;. Since p is minimal it cannot contain the 
radical of any ideal q; for i # 7. Hence Theorem 15 gives u(a, p) = q;. 


CorOLuary 1. Jf p,,).,...,P, (all different from R) are the minimal prime 
divisors of a then in any short representation of a as the intersection of a finite number 
of right primary ideals, u(a, p,),..., u(a, p,,) must occur among the right primary 
components. 


COROLLARY 2. A necessary condition that an ideal a be representable as the 
intersection of a finite number of right primary ideals is that u(a, p;) be right primary 
for all minimal prime divisors ); of a. 


DEFINITION 7.2. Jf a is representable as the intersection of right primary ideals 
then the upper component ideals u(a, p) corresponding to the minimal prime divisors 
of a are called the isolated right primary components of a. 





'The restriction p * R excludes only the case in which @ is itself primary with radical R. 





NONCOMMUTATIVE IDEAL THEORY 55 


Thus the isolated right primary components of a are right primary ideals 
which occur a8 components in every short representation of a as the intersection 
of right primary ideals. 

It is now easy to give examples of rings satisfying the ascending chain condi- 
tion in which not all ideals are expressible as the intersection of a finite number 
of right primary ideals. Let R be the ring of all polynomials in two noncommu- 
tative indeterminates x and y with coefficients in a field K. Let a be the ideal 
(xy) which has two minimal prime divisors p, = (x) and p, = (y), and is clearly 
not right primary. The radical of a is p, (\ p, or (xy, yx). Now if aRb C (xy) 
b¢ (y) then a€ (xy). Hence (xy) has property (A) relative to the m-system 
C(p,.) and therefore u(a,p,) = a. Since u(a, p,) is not right primary Theorem 
17, Corollary 2, shows that a is not the intersection of a finite number of right 
primary ideals. 

Fitting’s decomposition theorem [1] represents a as the intersection of two 
“primary left ideals’, namely, 


(xy) = (x) CO (y), 


where (x) is the two sided ideal generated by x and (y), is the left ideal generated 
by y. In the present paper, however, we consider only representations as inter- 
sections of two sided right primary ideals. 

It can also be shown by examples that the necessary condition given in Theorem 
17, Corollary 2, is not sufficient. Let R be the same ring as above and let a= (x?, 
xy). Then a has a unique minimal prime divisor p = (x) and r(a) = (x). But 
a is not right primary since xRy C a while x¢a and y¢r(a). Now [(a, p), the 
set of all elements r such that rRm C a for some m in C(p), is easily seen to be 
equal to (x) and therefore u(a, p) > (x). But (x) has property (A) relative to 
C(p) and therefore u(a, p) = (x). Since u(a, p) is right primary the necessary 
condition of Corollary 2 is satisfied. By Theorem 17, in any short representa- 
tion of a as the intersection of a finite number of right primary ideals, (x) must 
occur as one component. The other components must be sought among the 
other right primary divisors of a, namely, (x,y) (x*,y), (x,y"), (x*,xy,y") and 
(x*,xy,yx,y"), n 2 2. It is easy to verify that none of the possible finite inter- 
sections is equal to a. 

We may note also that although a is not right primary it is left primary since 
aRb © (x*,xy) and a ¢ (x) together imply 6 € (x*,xy). 


THEOREM 18. Jfa=4q,\q,(0\...(\4q, is a short representation of a as the 
intersection of right primary ideals q,,4,,...,4, then a prime ideal » # R which 
divides a is the radical of one of the ideals q,; if and only if p is nrp to u(a,p). The 
ring R is the radical of one of the q; if and only if R is nrp to a. 


Proof. (i) Let the radicals of q,,4,,.. .,@, be p,,).,...,0,. Ifp = p, but 
yp ~ R, then by Theorem 15, 


(4) u(a,p) = 4,,0\a,0\...0NR 














56 D. C. MURDOCH 


where ),, ~.,...,), are those primes among the p, which are contained in p. 
Now (4) isa short representation of u(a, p) and p is the radical of one of the ideals 
G:, 42, +.» 4, and contains the radicals of the rest of these. Hence by Theorem 
12, an element x is nrp to u(a, p) if and only if x € p. Hence if p = p, then p 
is nrp to u(a, p). 

(ii) Now suppose p > a, p * R, and » is nrp to u(a,p). Since, by Theorem 
16, all minimal prime divisors of a are among the primes p,, p,, ... , ,, P must 
contain at least one of these. Suppose p contains p,, p,,..., ~, but not p,.,, 

.,~,. By Theorem 15, 


u(a,p) = 4, 19......0% 
is a short representation of u(a, p), and since p is nrp to u(a, p) Theorem 12 gives 
p © p, ® Dp, ® eee ® Dey 


where @ denctes a set-theoretic sum. But since p > p,; (i = 1,2,...,&) it 
follows that 


(S) p=p,@p,®@...@p,. 


In the sum (5) any prime p, which is contained in the sum of the remaining 
primes may be omitted. We may assume therefore that 


(6) p=p,O@p,@...@p,, 


where / < k and no one of p,,..., »; is contained in the set-theoretic sum of 
the remaining ones. 

Now if / > 1 the product p,p, . . . p,_, cannot be contained in p,, for if it were, 
since p,; is prime, p, would contain one of the primes p,, p,,... , P)-,, contrary 
to the assumption of the minimal length of the sum (6). Hence we can choose 
elements p; from p; (i = 1, 2,...,/— 1) such that p,p, ... p,_, does not belong 
to p,. Moreover, we can choose an element /,; of p, which does not belong to p, 
for i <J/. Form the element x = p,p,...,-,+,. Being the sum of two 
elements of p, x € pand therefore x © p; for some value of j such that 1 < j < 1. 
But this is impossible, for if 7 < 1 then p,p, ... p;-, p; but p, ¢ p;, while if 
j=, pi € p, but pp, ...p,-,¢p,. This contradiction leads to the conclusion 
that / = 1 and hence p = p, for some value of 7. 

(iii) Suppose R is the radical of one of the q;, and let it beq,. Since R is there- 
fore the only minimal prime divisor of q,, Theorem 10 gives R* C q,. Choose an 
element g which is contained in q,(\q,(\...()q, but not in q, so that ¢¢ a. 
Then gR’ Ca. Assume s is the least exponent for which this holds, so that 
s 2 1, and choose an element gq’ in gR*-* such that g’ is not contained ina. Then 
q'Rr € a for all elements r of R and therefore R is nrp to a. 

(iv) Conversely, suppose R is nrp to a so that for every element r of R there 
is an element a, not in a such that a,Rr Ca. Hence for each i, a,Rr € a, while 
for at least one j, a,¢q;. Thus, since q; is right primary, r € p; and 


R=y),@p,@...@p,. 


NONCOMMUTATIVE IDEAL THEORY 


ou 
~1 


Now let 

(7) R=)p,@07,@...@p, 

be the sum of minimal length which is equal toR. If] > 1 choose p,, p,,..., pi 
as above and we find the element /,p, ... »;., + p; belongs to none of the 


primes ),, P,,..., ):, in contradiction to (7). Hence/ = 1 and p, = R for one 
value of i. This completes the proof of Theorem 18. 

If an ideal a can be represented as the intersection of right primary ideals 
4, %.,--+»,4,-, Theorem 18 shows that the radicals of these right primary com- 
ponents are uniquely determined since the criterion given to determine whether 
p is one of these radicals or not depends only on panda. Similarly the number 
of right primary components in a short representation is also uniquely deter- 
mined as the number of distinct primes among the radicals of q,, q,,... , 4. 
We may therefore summarize the results of this section as follows: 


THEOREM 19. Let R be a noncommutative ring in which the ascending chain 
condition holds for two sided ideals. If an ideal a in R can be represented as the 
intersection of a finite number of right primary ideals then a has a short represen- 
lation as such. In any two short representations of a the number of right primary 
components is the same and the radicals of the two sets of primary components co- 
incide in some order. Moreover, the isolated primary components are the same for 
all short representations. 


Although Theorem 19 shows that the well-known results of E. Noether carry 
over to the noncommutative case for those ideals which can be represented as 
the intersection of a finite number of right primary ideals, a necessary and suffi- 
cient condition that such a representation exist is still unknown. The ascending 
chain condition is not sufficient to ensure this for all ideals as it is in a commuta- 
tive ring. The necessary condition given by Theorem 17, Corollary 2, is not only 
not sufficient but is difficult to apply in a particular case owing to the difficulty 
of finding the ideals u(a,p). It is hoped to return to this problem in a later 
paper. 


REFERENCES 

1. H. Fitting, Primarkomponentenzerlegung in nichtkommutativen Ringen, Math. Ann., vol 
3 (1935), 19-41. 

2. W. Krull, Zur Theorie der sweiseitigen Ideale in Nichtkommutativen Bereichen, Math, 
Zeit., vol. 29 (1938), 42-54. 

3. , Idealtheorie in Ringen ohne Endlichkeitsbedingung, Math. Ann., vol. 101 (1929), 
729-744. 

4. ————, Idealtheorie, Ergebnisse der Mathematik, vol. 4 (Berlin, 1935 

5. N. H. McCoy, Prime ideals in general rings, Amer. J. Math., vol. 71 (1949), 823-833. 

6. ———, Rings and ideals, Carus Mathematical Monographs, No. 8 (Baltimore, 1948). 


The Unwersity of British Columbia 











NOTE ON NORMAL DECIMALS 
H. DAVENPORT AND P. ERDOS 


1. Introduction. A real number, expressed as a decimal, is said to be 
normal (in the scale of 10) if every combination of digits occurs in the decimal 
with the proper frequency. If a,;a2...a, is any combination of & digits, and 
N(t) is the number of times this combination occurs among the first ¢ digits, the 
condition is that 
(1) . lim te A a 
It was proved by Champernowne [2] that the decimal - 1234567891011 .. . is 
normal, and by Besicovitch [1] that the same holds for the decimal - 1491625 .... 
Copeland and Erdés [3] have proved that if 1, po, . . . is any sequence of positive 
integers such that, for every @ < 1, the number of /’s up to m exceeds n’ if n is 
sufficiently large, then the infinite decimal -);)2); . . . is normal. This includes 
the result that the decimal formed from the sequence of primes is normal. 

In this note, we prove the following result conjectured by Copeland and 
Erdés: 


THEOREM 1. Let f(x) be any polynomial in x, all of whose values, for x = 1, 
2,...,@re positive integers. Then the decimal -f(1)f(2)f(3) ... is normal. 


It is to be understood, of course, that each f(m) is written in the scale of 10, 
and that the digits of f(1) are succeeded by those of f(2), and soon. The proof 
is based on an interpretation of the condition (1) in terms of the equal distri- 
bution of a sequence to the modulus 1, and the application of the method of 
Weyl’s famous memoir [6]. 

Besicovitch [1] introduced the concept of the (e, &) normality of an individual 
positive integer g, where « is a positive number and is a positive integer. The 
condition for this is that if a,a2...a, is any sequence of / digits, where / S k, 
then the number of times this sequence occurs in g lies between 


(1—«)10'g and (1+.)10'¢ 
q q 


where q’ is the number of digits in g. Naturally, the definition is only significant 
when q is large compared with 10°. We prove: 


THEOREM 2. For any « and k, almost all the numbers f(1), f(2),... are (¢, k) 
normal; that is, the number of numbers n = x for which f(n) is not (e, k) normal 
is o(x) as x — © for fixed « and k. 


Received February 9, 1951. 


NOTE ON NORMAL DECIMALS 59 


This is a stronger result than that asserted in Theorem 1. But the proof of 
Theorem 1 is simpler than that of Theorem 2, and provides a natural intro- 
duction to it. 


2. Proof of Theorem 1. We defined N(t) to be the number of times a particu- 
lar combination of k digits occurs among the first ¢ digits of a given decimal. 
More generally, we define N(u, ¢) to be the number of times this combination 
occurs among the digits from the (u + 1)th to the ‘th, so that V(0,?t) = N(Q). 
This function is almost additive; we have, for ¢ > u, 


(2) N(u, t) S N(t) — Niu) S N(u, t) + (k — 1), 

the discrepancy arising from the possibility that the combinations counted in 
N(t) — N(u) may include some which contain both the uth and (u + 1)th 
digits. 


Let g be the degree of the polynomial f(x). For any positive integer n, let 
x, be the largest integer x for which f(x) has less than n digits. Then, if m is 
sufficiently large, as we suppose throughout, f(x, + 1) has m digits, and so have 
Sf (X%_ + 2),...,f (X41). It is obvious that 


(3) X, ~ a(10'7")" asn—> @, 
where a is a constant. 
Suppose that the last digit in f(x,) occupies the é,th place in the decimal 
-f(1)f(2) .... Then the number of digits in the block 
ff (Xm + 1) (xm, + 2)... f(%n41) 
is fn41 — t, and is also n(x»41 — X,), since each f has exactly m digits. Hence 
(4) tnai — by = 2(%ea1 — Xn). 


It follows from (3) that 


(5) tp ~ an(10'")" asn—> ©, 
To prove (1), it suffices to prove that 
(6) N(t,, t) = 107*(t — th) + o(tn) 


asn— o, fort, <t S tii. For, by (2), we have 


n- 


1 
N(t) — N(t) = 2 N(ép, tear) + N(be, t) + R, 


ah 
for a suitable fixed 4, where |R| < nk. Since (6) includes as a special case the 
result 
N (ty, tes) = 107*(t,41 — te) + o(t,), 

we obtain (1). 

In proving (6), we can suppose without loss of generality that ¢ differs from 
t, by an exact multiple of ». Putting ¢ = 4, + nX, the number N(t,, ¢) is the 
number of times that the given combination of k digits occurs in the block 














60 H. DAVENPORT AND P. ERDOS 


(7) f (Xn + 1)f (en + 2)... f(x, + X), 


where 0 < X S x41 — %. We can restrict ourselves to those combinations 
which occur entirely in the same f(x), since the others number at most (k — 1) 
* (Xn41 — X,), which is o(t,) by (3) and (5). 


The nuinber of times that a given combination a;a2 . . . a; of digits occurs in a 
particular f(x) is the same as the number of values of m with k S m S n for 
which the fractional part of 10-"f(x) begins with the decimal -a;a_...a,. If 


we define @(z) to be 1 if z is congruent (mod 1) to a number lying in a certain 
interval of length 10-*, and 0 otherwise, the number of times the given combina- 
tion occurs in f(x) is 


n 
> 0(10-"f(x)). 
m=k 
Hence 
Zna+X n 
N(t,,t) = 2 Zz 0(10°"f(x)) + Olxns1 — Xn), 
t—z,+1 m=k 
the error being simply that already mentioned. 
To prove (6), it suffices to prove that 


n 2n+X 
(8) > = o(10-"f(x)) = 10°-*nX + 0(m(x%n41 — Xn)) 
m=k I=—T,+1 
for 0 < X S xXqi1 — X,. We shall prove that if 6 is any fixed positive number, 
and én < _m < (1 — 4)n, then 
Zat+X 
(9) >> 0(10°"f(x)) = 10°*X + o(xpsa — Xn) 
r=—z,+1 


uniformly in m. This suffices to prove (8), since the contribution of the re- 
maining values of m is at most 26nX, where 4 is arbitrarily small. We have 


(10) X S %as1 — Xe < a(10'")**’, 

and we can also suppose that 

(11) X > (x%es1 — x)” > B(10'"")""-™ 

where £ is a constant, since (9) is trivial if this condition is not satisfied. 

The proof of (9) follows well-known lines. One can construct [6; 4, pp. 
91-92, 99] for any » > 0, functions 6;(z) and 6.(z), periodic in z with period 1, 
such that @,(z) S 6(z) S 6@.(z), having Fourier expansions of the form 

6(z) = 10* — n+ DA, e(vz), 
(2) = 10* ++ DA, e(vz). 


Here the summation is over all integers » with vy ~ 0, and e(w) stands for e*™”. 
The coefficients A, are majorized by 


NOTE ON NORMAL DECIMALS 61 


|A,| S min ( , \), 
"| aw 


Using these functions to approximate 6(10-"f(x)) in (9), we see that it will 
suffice to estimate the sum 


Za+X 
Sam.e= 2 e€(10° of(x)). 
r=r,+1 
We can in fact prove that 
(12) Rad < er" 
for all m and » satisfying 
(13) in <m < (1 — 6)n, isv<7n, 


where C and ¢ are positive numbers depending only on 4, 7 and on the polynomial 
f(x). This is amply sufficient to prove (9), since X S xpi: — Xp. 

The inequality (12) is a special case of Weyl’s inequality for exponential 
sums. The highest coefficient in the polynomial 10-" »f(x) is 10-" vc/d, where 
c/d is the highest coefficient in f(x), and so is a rational number. Write 

<= _¢ 
10°" »= = -, 
d q 
where a and gq are relatively prime integers. Let G = 2". Then, by Weyl’s 
inequality, 


(14) 





Suim.ol? < CiX%g'(X°! + X%q"* + X9%q) 
for any « > 0, where C,; depends only on g and ¢«. In the present case, we have 


q = 10"d < 10°°""d, 
and 
g2 10"v~'c~' > 107 c. 


This relates the magnitude of g to that of m. Relations between m and X were 
given in (10) and (11), and it follows that 


G 7g < q < ore. 


where C;, and C; depend only on 7, c, d, and g. Using these inequalities for ¢ 
in (14), we obtain a result of the form (12). 


3. Proof of Theorem 2. We again consider the values of x for which f(x) has 
exactly m digits, namely those for which x, < x S X_41. We denote by 7 (x) 
the number of times that a particular digit combination a,a2...a,; (where 
l S k) occurs in f(x). Then, with the previous notation, 


n 


T(x) = Z 0(10-"f(x)). 


m= 


'The most accessible reference is (5, Satz 267). The result is stated there for a polynomial 
with one term, but the proof applies generally. 














62 H. DAVENPORT AND P. ERDOS 


We proved earlier that (putting X = x4; — x,), 


Tat+X 
-t_ wy 
>» T(x) ~ 10° “nX asn— @, 
r=—Z,+1 
Now our object is a different one; we wish to estimate the number of values of 
x for which T(x) deviates appreciably from its average value, which is 10~-'n. 
For this purpose, we shall prove that 


Int+X 

(15) 2 T* (x) ~ 1077 'n®X asn— ©, 
r=z,+1 

When this has been proved, Theorem 2 will follow. For then 


Znt+X 


>» (T(x) — 107'n)*? = ST*(x) — 2(107'n) ST (x) + 1077 'n*X 


t=—z,+1 
—2i 24, 
= 0(10 “‘n'X) asn— ©, 


Hence the number of values of x with x, < x S X,11, for which the combination 
@;d2...a; does not occur between (1 — ¢)10-'n and (1 + €)10-'» times, is 
0(Xn+1 — X,) for any fixed «. Since this is true for each combination of at most 
k digits, it follows that f(x) is (e, &) normal for almost all x. 

To prove (15), we write the sum on the left as 


Zat+X n n 


(16) > > SF 0(10-™f(x))0(10-"*f(x)). 


Z=—Z,.+1 m=—l m=! 
Once again, we can restrict ourselves to values of m, and m;, which satisfy 
(17) in << m, < (1 —6)n, in < mz < (1 — 8)n, 


since the contribution of the remaining terms is small compared with the right 
hand side of (15) when 4 is small. For a similar reason, we can impose the 
restriction that 


(18) M2 — m, > én. 


Proceeding as before, and using the functions 6;(z) and 62(z), we find that it 


suffices to estimate the sum 
In+X 


(19) S(n, my, m2, v1, v2) = Z e((10-™*y; + 10°" v2)f(x)), 


r=—Z_+1 f 


. . | ! 6 
for values of »; and v2 which are not both zero, and satisfy |»;| < 9~*, |vo| < 97?. 


If either »; or v2 is zero, the previous result (7) applies. Supposing neither zero, 
we write the highest coefficient again as 
a 


(10-™», + 10-™*»2)5 =< 


In view of (17) and (18), we have 
q = 10d < 10°°""d < C3X°°""d. 





NOTE ON NORMAL DECIMALS 63 


We observe that a cannot be zero, since 


10™"* |»2| < 10°" |v] < 4107 || 


i 


provided that 2n? < 10, which is so for large m. Hence 


¢> = 10" |yi|-tc > CX”. 


It now follows as before from Weyl's inequality that 


|S(n, m1, Me, Vi, v2) | < ar, 


where again C and ¢ are positive numbers depending only on 4, n, and the poly- 
nomial f(x). Using this in (16), we obtain (15). 


REFERENCES 


1. A. S. Besicovitch, The asymptotic distribution of the numerals in the decimal representation 
of the squares of the natural numbers, Math. Zeit., vol. 39 (1934), 146-156 

2. D. G. Champernowne, The construction of decimals normal in the scale of ten, J}. London 
Math. Soc., vol. 8 (1933), 254-260. 

3. A. H. Copeland and P. Erdés, Note on normal numbers, Bull. Amer. Math. Soc 

(1946), 857-860. 

. J. F. Koksma, Diophantische A pproximationen (Ergebnisse der Math., IV, 4; Berlin, 1936) 

5. E. Landau, Vorlesungen tiber Zahlentheorie (Leipzig, 1927). 

H. Weyl, Uber die Gleichverteilung von Zahlen mod. Eins, Math. Ann., vol. 77 (1916), 

313-352. 


» V ol 52 


— 


> 


University College, London 
The University, Aberdeen 














ON PRODUCTS OF SETS OF GROUP ELEMENTS 
HENRY B. MANN 


Let A = ' A, — - B = }B,, er B,} be sets of elements of a group © 
of finite order g. We define 


€ = AB = {A<B;}. 
By (4%), (SB), ... we shall denote the number of elements in Y, B, . . . respec- 
tively and by WY, B, . . . the sets of elements of G not in W,B,.... 
THEOREM 1. Either AB = Gor g > (A) + (B). 
Proof. Let C be an element not in € = AB. Let A, B,... be a generic 
notation for elements in WU, B,... respectively. All A are different from all 


CB— for otherwise C = AB. Thus there are at least (M%) + (B) elements in 
G. 


THEOREM 2. Let A,B be sets of elements of an Abelian group © and let 
CC AB. Then there exists a B* D> B such that 

(i) €* = AB* = GHC, where H is a subgroup of G, 

(ii) (AB*) — (AB) = (B*) — (B). 

We shall give the proof by induction on the number of elements in ¢. 
Clearly Theorem 2 holds with § = J the identity if © consists only of one 


element C. Now let € consist of the elements C = Cy, Ci,..., C,. Form 
the products C C;“' = D; and let § be the subgroup generated by the D,. 
Two cases arise. 


First case. For every i and k we have for some m 
CD = C,. 
Since C; = CD; it then follows that for every H C § we have for some m 
CH = Cn. 
Since CD,,~' = Cm, so that C,, = CH for every m, it follows that € = CS. 
Second case. There exist an 7 and a & such that 
CD." = AE, EC®. 


We then form the set 8; consisting of all elements of the form ED; which 
satisfy an equation 


Received August 2, 1950. 
64 


PRODUCTS OF SETS OF GROUP ELEMENTS 65 


for some ¢. Equation (1) implies also 


(1’) AED, = Cj. 
We shall prove: 


PROPOSITION 1. No element of 8; is in B. This follows easily since no 
element in 8 can satisfy an equation of the form (1). 


PROPOSITION 2. Let BU B; = %,* then ©, = AB,* D C. Otherwise we 
should have AED; = C, AE = C; which is impossible since E C 8 but C; Z AB. 


Proposition 3. (4B,*) — (AB) = (B.*) — (B) = (Bd. 


Equations (1) and (1’) show that ED; is in B, if and only if C; C €, = 4B," 
which proves Proposition 3. 


Since (€,) < (€) there exists by induction a set B*>D B,* DB such that 
WB* = CH where § is a subgroup of G and such that 


(AB*) — (AB,*) = (B*) — (B,"*). 
Adding this equation to Proposition 3 we obtain Theorem 2. 


Coro._Ltary (Davenport and Chowla). Let G be the additive group of resi- 
dues mod N. Let & = {ao = 0,a1,..., am}, B= {bi,..., bm} be sets of 
residues mod N such that (a;,N) = 1 fori >0O. Let © = AB. Then either 
& = Gor 


(2) (©) 2 m+n = (A) + (B) - 1. 


Proof. By Theorems 1 and 2 it is sufficient to prove the Corollary for the 
case that € = CH where § is a subgroup of .G. Consider the factor group 
G/H. “Let A’, B’ be the sets of cosets mod § that contain elements of & and 
% respectively. Let ¢ be the index and h the order of S$. By Theorem 1, 


t > (M’) + (%’). 


Hence 
(3) N = ht > h(X’) + h(B’). 
Since ao C §, a; Z § for i > 0, we have 
h(X’) —h > m, h(B’) > n. 
Substituting this in (3) we obtain 
N>m+n+h, (QC) =N-hoem-+n. 


The Corollary to Theorem 2 was proved by Davenport [2] for the case that 
N isa prime. Chowla [1] used Davenport’s methods to obtain the Corollary 
in its general form. Davenport later discovered that for the case when N is a 
prime the Corollary was already known to Cauchy (3). 











66 HENRY B. MANN 


It is interesting to note that the proof of Theorem 2 is closely related to the 
author’s proof of the fundamental theorem on the density of sums of sets of 
positive integers [4]. Thus the similarity between this theorem and the 
theorem of Davenport and Chowla is not as superficial as might have appeared. 


REFERENCES 


1. I. Chowla, Proc. Indian Acad. Sci., vol. 2 (1935), 242-243. 
2. H. Davenport, J. Lond. Math. Soc., vol. 10 (1935), 30-32 
3. , J. Lond. Math. Soc., vol. 22 (1947), 100-101. 
4. H. B. Mann, Ann. of Math., vol. 43 (1942), 523-527. 





Ohio State University 


THE FOURIER COEFFICIENTS OF THE MODULAR 
FUNCTION (7) 


WILLIAM H. SIMONS 


1. Introduction. In [3], H. Rademacher obtained a convergent series for 
the Fourier coefficients of the modular invariant J(r). He found that in the 
expansion 


12°J (+r -_ gue +> ata 
m=0 


the coefficients C,,, for m > 1, are given by 


Qe < A;(m) te vit) 
1 Ca = £25) Se f=), 
) J/m a k , k 
where 
, ari . . 
A,(m) => oe", hh = — 1 (mod k), 
& mod & 


and J,(z) is the Bessel function of the first order with purely imaginary argument. 
The >-’ above indicates the sum with respect to h from 0 to k — 1 with (h,k) = 1. 
The purpose of this paper is to discuss the Fourier coefficients of \(r), the 
fundamental modular function of level (Stufe) 2. It may be defined either in 
terms of theta-functions by 











ae >t aan | 
nie = [2010] = | 


@,(0|\r)j 


q 
(2) xh 
eo 1 + q" 5 , om 
= 16, I] (++) = l6g{1 — 8g + 44¢°...], g=e"",", 
n=1 1 a q 
or by the equivalent definition 
Oe 
(3) ood, tt deal oo. 


where ¢),€2,€3 are given in terms of the Weierstrass elliptic function p(z) and its 
periods 2w;, 2we by 


€: = P(wi), 2 = P(wi + w2), @3: = Y(w2). 


The function \(r) is invariant under the substitutions of the congruence sub- 
group I(2) of the full modular group defined by all substitutions 


Received August 25, 1950. This paper was prepared for publication while the author was a 
member of the Summer Research Institute of the Canadian Mathematical Congress. 


67 











68 WILLIAM H. SIMONS 





‘_ar+b 
‘ es er+d 
where a,b,c,d are integers with 


ab\ (10 , ab 
(0) = (3 4 (mod 2) and |. 5 


d(r) = 2 ang”, q= ’ ant 


= 1. 





For the expansion 


it is found that 





— = <A,(m) (Ave) 
ose & F “TS 


k=2(mod 4) 


(4) 


Moreover, it is found that the coefficients in the expansion of the reciprocal 
function 
1 1 


oe ete nis . = rirm 
u(r) = (r "ia** + 2 bne 


(5) i isin 
k=0(mod 4) 

The method is essentially the same as that used by Rademacher. In §2 the 
transformation equations for \(r) are derived. The main result (4) is obtained 
in §§3 to 7, and equation (5) is derived in §8. 

The following interesting comment was made by the referee of this paper. 
“The function j(r) is determined essentially by its pole at r = ©; it is regular 
everywhere else. But 1/j(r) has a pole at an interior point of the upper half- 
plane, and so its Fourier coefficients cannot be determined in as simple a manner. 
This situation is unavoidable with functions of the full modular group, which 
has but one parabolic cusp. On the other hand, the subgroup which Dr. Simons 
treats has 3 parabolic cusps, so it is possible to define functions which together 
with their reciprocals are regular in the upper half-plane by merely placing the 
zero and the pole at the cusps of the fundamental region. A(r) is such a function. 
It is of interest to note that both for A(r) and 1/A(r), the Fourier coefficients are 
given by series which, apart from a trivial numerical factor, are composed of 
terms taken from the series for j(r).”’ 


2. The transformation equations. 


LemMA 1. Let a, b, c, d be integers with ad — bc = 1, and let 


ar+b 


died = 























FOURIER COEFFICIENTS OF A MODULAR FUNCTION 69 
Then d(T) and X(r) are related as follows: 
1° 2° ro. © | 6° 
(¢ 2) (mod 2) (0%) (31) (i 3) (3) (3) (14) 
A(r) 1 1 — 2 
d(T) Mr) XG) —il iG) ITonm|'- Ol! -iB 




















The lemma is an immediate consequence of the transformation equations for 
the theta-functions and definition (2), or of the transformation equations for 


€1,€2,€3 and definition (3) [cf. 5]. 


LEMMA 2. 





{1 — X(r)}* — ‘| 


—- & — r(r)}' +1 


By definition, 


\(2r) = 62(0|2r7) 














65(0|2r) 
But [5, p. 268], 
263(0|2r) = 63(0|r) — 04(0|r), 
-— 263(0|2r) = 63(0|r) + 63(0|r), 
where 
6.(0|r) = >> (— 1)"g"” = 1 — 2¢ + 2q'-.... 
Therefore 
6s — 2030; + 6% 
A(2r) = 2 OE 
(27) = i + 20st + of 
A(2r) +1 _ +H 
1— (2r) 29%’ 
pan + $f _% 
i} —(2r)J af a” r+ 
Now 
6 65 — 0 03 
a =1—-~?=1-r 
Os 6: *) 
and therefore 
ae) +3 mS 1 4 R= ae) 
a? — (2r)) 1— Ar) + 1 — X(r) Ee — X(r)’ 


so that 


A(2r) +1 2—X(r) 
1—X(2r)  2f1 — A(r)}F 














70 WILLIAM H. SIMONS 


Solving for \(2r) gives 








ee. i 
A(2r) = 2 = “oe 
Tint? 


2 — XA(r) — 2{1 — A(r)}? 
2 — Xr) + 2{1 — A(r)}} 


ii E — r(r)}# — 1) 
~ Lfl — A(r)}A + 14° 
THEOREM 2. Let k be an even integer and h and h’ be integers such that (h, k) = 1, 
and hh’ = — 1 (mod k). Further, let 


_ of # i) r= é) 
raids é and T = A7- + 7): 


n X(r) if kk = 0 (mod 4), 











Then 


I 


Proof. Define 
(%) * “" 2(—1—hh am 
cd} \k/2 —h 
Then a, b, c, d are integers with ad — bc = 1, and 


ar +b 
7 


ort 
(C2) = (51) coos 


and so by Lemma 1, case 1°, A(T) = A(r). If & = 2 (mod 4), then 


(2) =(1) mts 


and so by Lemma 1, case 3°, A(T) /(r). 





If k = 0 (mod 4), then 


THEOREM 2. Let k be an odd integer and let h and h’ be integers such that 
(h,k) = 1 and hh = — 1 (mod &). 


Further, let 
= (3+) ro(K+4) 
re NS BS’ Nk bel 


(Liat) )—1}' - xT] fh 
A(2r) = 4 


Pins 
Eee if h = 0 (mod 2). 


Then 


Ih 


1 (mod 2), 





FOURIER COEFFICIENTS OF A MODULAR FUNCTION 71 


G)-G “'=P™) 


Then a, 5, c, d are integers with ad — bc = 1 and 


_ar+b 
 ertd 


(a). Let h’ = 1 (mod 2) and A = 1 (mod 2). Then 


ab 10 
(: ’) “ (; ) ques 2), 


and so by Lemma 1, case 3°, A(T) = 1/A(r). Substituting for A(r) in Lemma 2 
gives 
A(2r) = E — 1/A(T)}* = 1) " [pw — 1)'- LeaTid | 
{1 — 1/A(T)}i+ 1 {A(T) — 134+ {A(T)} A” 
(b). Let hk’ = 1 (mod 2) and hk = 0 (mod 2). Then 


ab\ 11 
(5) (; ‘) (mod 2), 


and so by Lemma 1, case 6°, A(r) = 1/(1 — A(T) ). Substituting for A(r) in 
| Lemma 2 gives 
| (2a) . 11 , 
A(2r) = | 2) = 7) 7 [amr ~ coma 
(T) Vt {A(T)}# + {A(T) — 1}! 
Frey -1 


(c). Let k’ = 0 (mod 2) and h = 1 (mod 2). Then 


b 01 
(*°) 7 (; ) yuan 2) 
and so by Lemma 1, case 4°, A(r) = 1 — 1/A(T). Substituting in Lemma 2 gives 
| _ fay - iP. [PO a 
| aan) = | RO — Loa id: 
(d). Let hk’ = 0 (mod 2) and h = 0 (mod 2). Then 
b 01 
SS ” (; \) aati 


and so by Lemma 1, case 5°, A(r) = 1 — A(T). Substituting in Lemma 2 


then gives 
_fpa}- i} 
nan = [Ret 


Proof. Define 
































72 WILLIAM H. SIMONS 


By combining the results of (a) with (b) and those of (c) with (d) the result of 
the theorem is obtained. 


3. The Farey dissection. Let 


A(r) = f(g) = > Onq”; q=e"". 
Then by Cauchy’s theorem, 
1 f f@) 
Om tI. a dq, 


where the integration is in the positive sense around the circle C defined by 
W= eo, 

N being a positive integer. Using the Farey dissection of order N of the circle 

C, the integral may be expressed by the sum 


@) 
Om = Fri 1 Je. nt 


2m O<A<EEN k q 
(a, k)=1 


where £,., is the Farey arc corresponding to the fraction h/k in the Farey series 
of order N, and 


q = exp (. 2nN* + nit + 2xi¢). 








Then 
a - h 
flex} — 2xN + 2miz + 2zi¢t) 
an = : d¢, 
“ae exp(- 2emN* + 2a = + 2nim¢) 
- k 
where 
eo i othe 1 
6) yo k+k, kR+hi)’ 
w _h+he ts) 
¢ Eth k ~ RR +h) 


hi/k1, h/k, he/k2 being three consecutive terms of the Farey series of order N. 
For convenience the double sum over 0 < hk < k < N with (h,k) = 1 will be 
denoted by 


Ms 


ra 


k 


Then 


N 
dm = exp (2xmN~*) >> ap( _ 2 wim) 
Ak 


we 
, { 1 exp} — 2sN* + 2rit + 2xiet) exp ( — 2xim¢)d@ 


--9’ 





FOURIER COEFFICIENTS OF A MODULAR FUNCTION 73 


N 
= exp (2amN™*) >> exp ( ~ 2win" ) 
Ak 


. [lA exp {2 (! + is) exp ( — 2xim¢)d¢, 


where zs = k(N-* — ig). 

Now let the above summation be broken up into three sums 2;, 22, 2s, the 
first consisting of those terms for which k = 1 (mod 2), the second those for 
which k = 2 (mod 4), and the third those for which k = 0 (mod 4), and let 
I;, Iz, and Is, be the parts of a,, corresponding to 2;, Zs, Z3 respectively. Thus 


Qm = 15+ [2+ Is. 
4. Evaluation of the integral /;. 


N kl 
I; = exp (2amN~*) > yi exp (- 2nimt) 


sutimed 4) @, Del 


. [A exp Jani # + is)t) exp ( — 2xim¢)d¢. 


Applying the transformation equation of Theorem 1, for k = 0 (mod 4) gives 


Is = exp ( — 2emN*) = exp (- 2ximt) 


autimed 4) an ny 1 
. ¢ A x {2 {¥ + i) xp ( — 2ximd)do 
e exp )2mi\7 ke ex] T " 


f(g) = A(r) = ey Ong’, q = exp mtr, 


But 


and so, substituting for f(g), rearranging terms, and putting wo = N-*— ig, 
2 = kw, gives 


N 1 oe’ f2xi : ) _ 
L=> > Do an EXP inh - mh) ¢ exp | 27mw — ip dd. 
k=l h=0 —?’ n=l j 


k=O(mod 4) (A,k)=1 


Use is now made of a result due to Estermann [2]. Let ¢’ and ¢” be defined 
by (6), and let 


e(N, 6,4, 8) = 4 Fi. Sigal 


0, otherwise. 
Then 


k 
g= > b, exp {2xirh /k}, 
r=) 


where h’ is an integer satisfying hh’ = — 1 (mod &), and 5, is independent of 
h and 














74 WILLIAM H. SIMONS 


k 


> |b-| < log 4k. 


r=1 


Introducing the function g(N,¢,h,k) into the integral J; gives 


N » k+l) _ h’ 
I= 7 > | > 6, exp (anit) 


k=l n=l —1/k(N+1) ‘r= 
k= 0(mod 4) 
2an , 2ni, 
- exp | 2xmw —=—]} >> exp }——(nh — mh) (d¢. 
k a Amodk k 


The latter sum is a Kloosterman sum [4;1] and has the estimate O(k***«m"*). 
Also, the real part of 2rn/k*w is 








2an 2anN* 2an 
Ray? a 4 2 27-2 Bay? "2 
k*(N* —ig)/ k*(N*+ 9°)” R°N* + R'N*G 
2an 
> i+i mn, 
and 
Theref R(2emw) = 2xmN~*. 
ereiore 
N eS 1/k(N+1) ok 
|Z3| = >» > ae| > \b,| exp (2amN~*)k***m' do 
= .F © n=1 —1/k(N+1) rel 
N 1/k(N+1) 
= > mp2 log 4x dé 
7 e —1/k(N+1) 
~ 1/3,2/3+e 1 
= / / es 
" he in) 
k= 0(mod 4) 
and so 
of 1 . 1/t+e, 1/3 
I| = ANT eM *) 
|Is| > m 


a O(n n “), 


5. Evaluation of the integral /.. 


N k—1 
I, = exp (2xmN*) > = exp (- 2nimt) 


km2(mod 4) (h,k)=1 


Se kewl +9) a0 
” f{ exp y2 a k + E exp ( — 2rim¢)d¢. 


Now, by Theorem 1, with k = 2 (mod 4), and putting g = e*“ and gq’ = e**, 


FOURIER COEFFICIENTS OF A MODULAR FUNCTION 75 


- (nor 


= rae + > bag” 


f@ 





n=O 


1 ’ 
169" + fe(q ). 
Therefore 


N k—1 
I, = exp (2emN~) p> p exp ( —- anim?) 


el h=—0 
aad{med 4) @&bD=1 


$”" ’ . 
é [" n\ exp \2 mt + i\t) exp ( — 2rim¢)d@ 


_ Io1 + Ts,2, 


where f:(q’) is replaced by 1/169’ in J:,, and by f2(g’) in I2,2. Introducing the 
function g(NV,¢,h,k) into the integral J, . and proceeding as in §4 gives 


N @ 1/k(N+1) k Pn 
[Z2,2| = of ym y be | > |d,-| exp (2amN~*)k”* mas) 


k=l n=0 —1/k(N+1) rel 
k= 2(mod 4) 


_ (4 > y+") 
Ne 


O(N~“**m as 


Next, 
N k—1 h 
Io. = exp (2xmN~) > } exp [. 2ximt) 
t= h=0 k 


k=? (mod 4) (A,k)=1 


¢”’ 1 : h 1 ° 
| 16 °xP t. 2niy i + i) exp (— 2nim¢)d¢ 
—~’ 


k=2 (mod 4) (A, k)=—1 
- 2a 

| exp (2 mw + 22a 
—?’ k Ww 


km2 (mod 4) (4,k)—1 


eh 19’ ( as) 
. na exp | 2xmw + Bo dw 


ll 
als 
Maz: 
MI 
@ 
* 
0 
| 
SS) 
~|§ 
Pe 
3 
aa 
~ 
—_—— 
~—— 


Now, 











76 WILLIAM H. SIMONS 


o nal a < ei 
R(kit+k) ~kR(N +1) 
and 
“7 1 1 
¢ “ie +h SRF 1)’ 
and so 
1 N k—-1 if ; 
lni=—-Z > 5 exp (— 22m n+) 
146 n=O 
km? (mod 4) (h,k)=1 
'N "+ i/k(N+1) 
4" exp (tema + 2 + ae), — if ro 
N*+ 19’ 
—N*-+-4/k(N+1) —N *—i/k(N+1) N~ *—4/k(N+1) 
> efresag } eetpmene 2 29 
N~*+4/k(+1) —N”~ "+ i/k(N+1) —N~*—i/k(N+1) 
+ Foo ome + 
w —— }dw 
N~*—i/k(N+1) P k*w 
(7) oS Pagel” on (em + 3s), 
8 i *P Bw 
k=2 (mod 4) 
+Ki+K.+K;+ Kit Ks, 
where 
A,(m) = ; exp (- oot teh + i’). 
h mod & & 
Now 


‘ ‘(i *+-4/k(N+1) ( oe.) 
K,= 16 - A,(m)| _. exp | 2amw + Bo dw. 


N +t 
k=2 (mod 4) 


Introducing the function g(N,¢,h,k), and integrating from N~*+ 1/k(N + k) 
to N-*+- i/k(N + 1) gives 





|\K,| = > Rn 8 log 4k i) = O(N-***m"*), 
Similarly 
\Ks| -_ O(N -***m"*), 

In Ko, 

w=uti/k(N+1), —N°<u<N”, 

1 u ar-222 2 2 

R - = 27 ews <% R(N +1) <k, 

so that 





exp (ema + 2=))| < exp (2amN~ + 2n). 
Therefore 


FOURIER COEFFICIENTS OF A MODULAR FUNCTION 77 





N 
\K3| an > my") - O(N ***), 
Similarly 
\K,| = O(N -Y**%m"*), 
Again, in K3, 
, 1 
oan «aw | 2 ’ — 
o= N+ w, ENED <v<z RIN + 1) 
Also 
R (w) = —-N’*<0 
— N” 
we(2 1) _ ae( — +- ) ™ ant 2 <9, 
N‘+w N +9 
and hence 


| exp (ema + ey <=. 


Therefore 


IK;| ie > p/** ny, nt) inn O(N"), 


Collecting these results together and substituting back into (7) gives 


N 
Ini = 3 Dy A,(m)L,(m) + O(N7Y***m™"), 
k 


k=2 (mod 4) 


where [6; 3] 


1 (0+) 2 4 
L,(m) = | exp (2 mw + ae), = wail {sere} 


I,(z) being the Bessel function of the first order with purely imaginary argument. 
Therefore 





_ er SS A,(m) (sey) “the v3 
l= 3m > , AN; + O(N m'’*), 


k=l 
k=2 (mod 4) 


6. Evaluation of the integral J,. In J,, consider 


h , iz h 1 
o'er, ee 


r( exp {2 m ¢- 4 =\}) f( exp {2mir}) = A(2r). 


Then, by Theorem 2, with ¢ = exp iT, 


A(2r) = 1+ 160? — 1284+... 
when h’ = 1 (mod 2), and 


A(2r) = 1 — 164 + 128% — 


so that 











78 WILLIAM H. SIMONS 
when fh’ = 0 (mod 2). These may be combined by replacing ¢ by 
= exp ri(T + h’) = t exp(rih’), giving 

A(2r) = 1+ 16¢4 + 1287 +... 


> Unt *. 


n=0 


Applying the transformations of Theorem 2 to the integrand of J; gives 


N k—1 
exp (2xmN~*) >> 7 exp( - 2zim?) 
k=l h=0 


k=l (mod 2) (A, k)=1 


¢’’ . , . . ’ 
5 , > Un EXP rads + A exp (si ) exp (— 2xim¢)d¢ 
’ n=O 
co 1/k(N+1) + h 
> > (- iy > b, exp ( 2nir*) 


k=l =O 2 —1/k(N+1) rl 
k= 1 (mod 2) 


I; 


k—-1 ) 

™m 

. _— 2X h — 4mh) (do. 

exp (2m ca » exp £x nh mh) ¢d@ 
(A. k)=1 


Now the latter sum in the integrand is an incomplete Kloosterman sum for 
which we have [2; 4] the estimate 


O(k”***(4m, k)'”*) = O(8****m”"), 


w(*) - anN — oF 
2k*w) ~ 2(N* + RNG?) ~ 4 


f1/k(N+1) 
> hm a * lua le “exp 2amN~* 46) 


k=l J —1/k(N+1) 
1 —ls3+e« 1/3 
Mh k m 
4¥ k=l 
= O(N~”***m*), 
7. The convergent series for a,,. Collecting together the results of §§4, 5 
and 6, we have 


Qn = 13+ 12+ 13 


T ~ A;(m) (4 “) na Vite 1/3 
8m 2. — P + O(N m’”). 


k=2 (mod 4) 


Also 





Therefore 


Finally, letting N — ©, we get 


__* Arm) (Sev) 
(4) on 8m p» ro" s F 





FOURIER COEFFICIENTS OF A MODULAR FUNCTION 79 


As a numerical example we may compare the actual value of ais with the 















value obtained from the series (4). Thus aig = — 316342272. Using the series 
for ays, we have 
z = A,(16),({ 16% 
adie = 32 > _* 16) 7{ 162) 
ke? (mod 4) 
T 16 
644? (16) I,\ — = — 316342253.1678 
—s ri 
io" as) 1 a) 18.6991 
7 Ars(t6)14 1 z) = — 0.0935. 


8. The reciprocal function u(r). 


THEOREM 3. Lel 


a a #(0|r) | 
u(r) g(q) = X(r) 1 [ne 


1 ; 
pines +...) 








= is + zy bing’. 
Then, for m > 0, 
_ FF Anlm) (Sent), 
©) be = 37m ty bk ING 


k= 0 (mod 4) 
Proof. Since the analysis in this case is essentially the same as for A(r), we 
will only outline the proof. The transformation equations for u(r) may be 
obtained directly from those for A(r). Now, by Cauchy’s theorem, for m > 0, 


1 { g(@) 


” 2m Cc qh q, 


where, as before, C is the circle of radius |g) = exp(—2xN-*). Therefore 


— : ) 
) —2 
LE, exp( rim; 


j=l 


{" f ex ,) jae * +. i) exp (— 2rim¢)d@ 
- PUTA B/S] Oo 


Let Dm = bm.1 + Om.2 + bm.3, where 6,1 consists of the terms of 5, for which 
1 (mod 2), b,.2 those for which k = 2 (mod 4), and 6,3 those for which 
0 (mod 4). Then it may be shown that 


bn = exp (22mN~*) 


N 
k= 


a 
i il 














80 WILLIAM H. SIMONS 


bar = O(N ***m*), bas = O(N ***m”*) 


and 


T z A,(m) (te) y—-1/3+e._ 1/3 
bm.3 = 7m > ia + O(N m”*), 


k= 0 (mod 4) 


Then letting N — © we get equation (5). 


Similar results may be obtained for the Fourier coefficients of powers of A(r) 
and u(r). However, these are omitted here since the method used in obtaining 
them is merely a repetition of that given for A(r). 


REFERENCES 


. Davenport, On certain exponential sums, J. Reine Angew. Math., Bd. 169 (1933), 158-176. 

. Estermann, Vereinfachter Beweis eines Satz von Kloosterman, Abhandlungen aus dem 
Mathematischen Seminar der Hamburgischen Universitat, Bd. 7 (1939), 82-98, es- 
pecially 94. 

3. H. Rademacher, The Fourier coefficients of the modular invariant J(r), Amer. J. Math., 
vol. 60 (1938), 501-512. 

. Salié, Zur Abschatzung der Fourierkoeffizienten ganzer Modulformen, Math. Z., Bd. 36 
(1933), 263-278. 

5. J. Tannery and J. Molk, Théorie des fonctions elliptiques, Tome II (Paris, 1896), 290. 

6. G. N. Watson, Theory of Bessel functions (Cambridge, 1922), 181. 


x 


4. 


= 


The University of British Columbia 




















AXIOMS FOR ELLIPTIC GEOMETRY 


DAVID GANS 


Introduction. Until recently the literature contained little on the axiomatic 
foundations of elliptic geometry that was non-analytical and independent of 
projective geometry. During the past decade this subject has come in for further 
study, notably by Busemann [2] and Blumenthal [1], who supplied such foun- 
dations. This paper presents another and, it is believed, simpler effort in the 
same general direction, proceeding by the familiar synthetic methods of ele- 
mentary geometry and using only elementary topological notions and ideas 
concerning metric spaces. Specifically, elliptic 2-space is obtained on the basis 
of six axioms, most notable of which is one assuming the existence of translations. 
The writer wishes to express his deep appreciation to Herbert Busemann for 
his invaluable help. 


I, SOME BASIC TERMS AND NOTATIONS 


Small letters always denote points. The distance between two points a,b of 
any metric space is denoted by abd or ba. A point c is said to be between points 
a,b (denoted by acb or bea) if c ¥ a or b and ac + cb = ab, and is said to be a 
mid point of a and b (denoted by c = mid(a,b) ) if, moreover, ac = cb. Anarc,a 
simple arc, and a geodesic arc mean, respectively, a continuous, a topological, 
and a congruent map of a closed Euclidean segment; a simple closed curve 
means the homeomorph of a Euclidean circle. An arc or geodesic arc with 
endpoints a,d is said to lie between a and b, and is denoted by (ab) or [ad], re- 
spectively. When no confusion can arise “geodesic arc”’ is often shortened to 
“arc’’, as in the phrase “the arc [ab]’’. In the interest of clarity of presentation 
and ease of reference, as well as to offer brief proofs, the number of theorems 
used has been large. Each theorem, when first stated, is denoted merely by an 
Arabic numeral, the word ‘‘Theorem”’ being omitted. Some proofs are not given. 

Il. GEODESIC ARCS AND STRAIGHT LINES 
Axtom 1. = is a compact metric space with at least two points. 
> is then bounded, and the function xy, where x and y are arbitrary points of 


>, has a maximum. We take this maximum as unit distance, calling points a 
and b conjugate if ab = 1. 


Axiom 2. Any distinct points a,b have just one midpoint if non-conjugate, 
just two if conjugate. 


1. If abc, then a and b have a unique midpoint, and likewise for 6 and c. 





Received September 23, 1950; in revised form September 17, 1951. 
81 














82 DAVID GANS 


2 is convex by Axiom 2, i.e., there is a point between each two points. From 
Menger [4] we then infer Theorems 2 to 4; Theorem 5 is immediate; Theorem 6 
follows from Theorem 1, Menger [4], and Axiom 1. 

2. There exists a geodesic arc between any two distinct points. 


3. An arc (ab) is a geodesic arc if and only if its length equals ab, or if it is the 
Shortest arc between a and b. 


4. The geodesic arcs |ab| are distinguished among all the arcs (ab) by the property 
that tf p,q are inner points of \ab], then apq or gpb or p = gq. 


5. If pis an inner point of |ab), then apb. 


6. There is just one geodesic arc between two non-conjugate points, just two 
between two conjugate points. 


7. If ab = 1 the two geodesic arcs |ab) have only a and b in common. 
8. If p,q are inner points of \ab], and apg, then pqb. 


9. If abc, then there is only one arc [ab] and one arc |bc], jab] + [bc] is an ar 
[ac], and b is an inner point of an arc {ac}. 


Axiom 3a. If abc and abd, then either c = d or bed or cbd. 
b. If abc and bcd, where ab + be + cd < 1, then abd. 


Axiom 4. If ¢ is a midpoint of a and b, then a point d exists such that 
cd = 1, cad, and cbd. 


10. If abc and abd, then either c = d; or acd, bed; or adc, bdc. 


Proof. Menger showed that abc, abd, acd imply bed [4, p. 107]. Similarly 
one can easily show that abc, abd, bed imply acd. Now assume abc and abd. 
Then either c = d or bed or bdc by Axiom 3a. If bed, then acd by the proposition 
stated two sentences back. If dbdc, then adc, as can be seen by interchanging ¢ 
and d in the proposition stated in the first sentence. 


11. The point d described in Axiom 4 is unique. 


Proof. Assume d’ is another point having the same properties as d. Then 
cd’ = 1, cad’, and cbhd’. Since cad and cad’ we infer by Theorem 10 that either 
d = d’'; or cdd’, add’; or cd'd, ad'd. Now cdd’' means that cd + dd’ = cd’, 
where c,d,d’ are all distinct, and this is impossible since cd = cd’ = 1. Likewise 
cd'd is impossible. Hence d = d’. 

DEFINITION 1. Let a,b be any distinct points, c = mid(a,b), and d the unique 


point such that cd = 1, cad, and cbd. The point-set consisting of c, d, and all 
points between c and d is called a straight line (or line) determined by a and b. 
12. A unique straight line is determined by any two distinct points a and b, this 


straight line being denoted by -ab-. Every straight line is a simple closed curve of 
length 2. 





AXIOMS FOR ELLIPTIC GEOMETRY 83 


Proof. \f ab < 1,a and 6 have a unique midpoint c and hence determine a 
unique line. If d is the point such that cd = 1, cad, and cbd, it follows from 
Theorems 7 and 9 that this line is the simple closed curve of length 2 formed by 
the two arcs [cd]. If ab = 1, let c’ = mid(a,b) and d’ be the point such that 
c'd’ = 1, cad’, and c’bd’. As above, a and b determine a line consisting of the 
two arcs [c’d’]. Now ad’ = } since ac’ = 3, c’d’ = 1, and c’ad’. Likewise 
bd’ = 4. Hence 

ad +db=1 = ab, 


so that d’ = mid(a,b). By Theorem 9, [ad’] + [d’b] is a geodesic arc between 
a and b, obviously not the geodesic arc between a and 6 which contains 
Hence the line under discussion consists of the two arcs [ab]. If now we let c’’ 
be the second midpoint of a,b and d” the point such that cd” = 1, cad", and 
c'bd’’, then a and b will determine a line formed by the two arcs [c’’d”’|. But, 
as above, this line also consists of the two arcs [ad], and it is clear that c’ = d’ 
and c’ = d”. Hence a and } determine a unique line, which is again a simple 
closed curve of length 2. 


13. Every straight line is congruent to a Euclidean circle of length 2. 


Proof. Let ab = 1. Then, as shown in the proof of Theorem 12, -ab- 
consists of the two arcs [cd], where c = mid(a,d) and d is the unique point such 
that cd = 1, cad, and cbd. We take a Euclidean circle K of length 2, map a,b 
on any two antipodal points A,B of K, and c,d on C,D, the midpoints of A,B, 
then map geodesic arcs [cad], [cbd] congruently on semicircles CAD, CBD, re- 
spectively. Now if p,g be any distinct points of -ab-, and P,Q their corresponding 
points of K, we must show that pg = PQ, where PQ denotes the length of the 
shorter of the two arcs into which P and Q divide K. Since a,b,c,d divide -ab- 
into four equal quadrants, with a,b the midpoints of c,d, and vice versa, it is easy 
to see that pg = PQif p and gare in the same quadrant or in adjacent quadrants 
But suppose p,g are interior points of opposite quadrants, e.g., let apc and bed 
Then 

(pe + cb + bq) + (pa + ad + dq) = 2, 


so that at least one expression in parentheses, say the first, does not exceed 1. 
Then pc + ch + bg <1. Also pcb and chg. Hence pcg by Axiom 3b, so that 
be +cqg=pq<i. Since PC = pce, CQ = cq we see that PC+CO<1 
Hence PQ = PC + CO = pg. 

Now let ab < 1. Again -ab- consists of the two arcs [cd], as above, and we 
map geodesic arcs |cad], |[cbd] congruently on semicircles CAD, CBD, re- 
spectively. Let e,f be the midpoints of c,d, chosen so that eac and foc. Thus we 
have eac, acb, and ea + ac + cb < 1, from which we infer eab by Axiom 3b 
Then eac, eab, acb permit us to infer ech by Theorem 10. Now ecb, cbf, and 
ec + cb + bf = 1 imply ecf by Axiom 3b, and from this we infer ebf by Theorem 
10. Hence ef = ec + cf = 1, and we infer by Theorem 3 that ecf is a geodesix 
arc between e and f. Also, since ed + df = 1 = ef we infer that (ed/) is a geodesic 











84 DAVID GANS 


arc between e and f. Thus c and d are the midpoints of e and f, as well as vice 
versa, so that the proof of the congruence of -ab- and K is just like that in the 
case ab = 1 except that now we use e and / instead of a and b. 


14. If -ab- is any straight line and p,q are any distinct points of -ab-, then each 
arc |pq|-ts contained in -ab-. 


Proof. \t follows from previous discussions that each arc on -ab- of length 
< 1 is a geodesic arc. There are two arcs (fq) on -ab-. If they are of equal 
length, each has length 1 and hence is a geodesic arc; in this case pg = 1 and 
-ab- consists of the two arcs [pg]. If the two arcs (pq) are unequal in length, only 
the shorter is a geodesic arc since its length is less than 1; in this case pg < 1, 
there is just one arc [pq], and -ab- contains it. 


15. If -ab- is any straight line and p,q are any distinct points of -ab-, then 
-pq- = -ab-. 


Proof. Let pg = 1. Then, as shown in the proof of Theorem 12, -pgq- 
consists of the two arcs [pg]. In the proof of Theorem 14 we saw that -ad- also 
consists of these two arcs. Hence -ab- = -pg-. Now let pg<i1. Then 
[pq] C -ab- by Theorem 14, so that also r C -ab-, where r = mid(p,qg). Let d 
be the unique point such that rd = 1, rpd, and rgd. Then, as shown in the proof 
of Theorem 12, -pg- consists of the two arcs [rd]. Now let s be the point of -ab- 
antipodal to r. Then rs = 1, rps, and rgs. It follows from Theorem 11 that 
d = s. Hence -pgq- consists of the two arcs [rs]. But, by the proof of Theorem 
14, -ab- also consists of these two arcs. Hence -ab- = -pg-. 


Combining Theorems 12 and 15 we get: 


16. Any two distinct points are on a unique straight line. 


Ill. Our SPACE > AND THE S. L. SPACES OF BUSEMANN 


An S. L. space (straight line space) is defined as one satisfying the following 
five axioms [2]: 

A. It is metric. 

B. It is finitely compact. 

C. It is convex. 

D. Each point p has an N-neighborhood xp < p, p < 0, such that for any 
distinct points a,b of N and each e > 0 there is a positive 5(a,b,e) < ¢ for which a 
unique point 5, exists such that bb, = o and abb,. 

E. Any two distinct points are on, at most, one geodesic (a geodesic is a 
locally congruent map of the real axis, and hence is not a geodesic arc). 


Clearly = has properties A, B, C. To show it has property D, let p be any 
point and let p < 4. If a,b are distinct points in this p-neighborhood of p, then 
ab < ap + pb <1. Leta,c divide -ab- into equal geodesic arcs. Then 6 divides 
one of these arcs into arcs [ab] and [bc], and also abc. Let x be a point such that 








AXIOMS FOR ELLIPTIC GEOMETRY S85 


abx. Then abc and abx, so that c = x; or acx, bcx; or axc, bxc by Theorem 10. 
Since ac= 1, acx is impossible, so that x.is on the unique arc [bc] by Theorem 9. 
For every positive 6 less than bc and « there is, of course, a unique point x on 
[bc] such that bx = 6. Thus = has property D. To show that 2 has property 
E we first note that any line -ab- is a geodesic since each point of -ab- has a 
neighborhood on -ab- which is a geodesic arc. Conversely, if G is any geodesic 
in = it contains two distinct points a,b such that an arc [ab] is contained in G. 
Now a,b determine -ab-, which also contains this arc [ab]. Thus -ab-, a geodesic, 
and G, also a geodesic, both contain [ab]. But in a space with properties A, B, 
C, D a unique geodesic contains a given geodesic arc [2, p. 21]. Hence G = -ab-. 


It then follows by Theorem 16 that = has property E. We have thus proved: 
17. 2 1s an S. L. space, its straight lines and geodesics being identical. 
Wishing to confine ourselves to plane geometry we assume: 

Axiom 5. = is two-dimensional in the sense of Menger-Urysohn. 


Since = is a two-dimensional S. L. space whose geodesics are all simple closed 
curves, we can infer the following [2, pp. 79, 81]: 


18. = is a projective plane and each two of its straight lines meet in a unique 
point. 


19. If p and L are any point and line of 2, respectively, where p Z L, and S 
is the set of all points on the lines joining p to each point of L, then S = = 


IV. MOTIONS AND TRANSLATIONS 


DEFINITION 2. A motion M is a single-valued, distance-preserving transfor- 
mation of = into itself. M(a,8) = a’, 8’ means that M sends subsets a,8 into 
subsets a’,8’, respectively. We say a is fixed under M if a’ = a. A sequence of 
motions M,, converges to a motion M if M,(x) — M(x) asn— © for each point 
x of 2. (The existence of motions other than the identity is assumed later. ) 


20. Motions are topological transformations, the set of all motions forming a 
group. 


21. Any infinite sequence of motions has a convergent subsequence. 


Proof. lf, for an infinite sequence of motions M, of a finitely compact 
metric space, a point b exists for which the set | /,(b)} is bounded, then M, 
contains a convergent subsequence [2, p. 177]. = is finitely compact and 
bounded. Hence { M,(b)}, b being arbitrary, is bounded. The theorem then 
follows. 


22. Each motion sends between-points into between-points, midpoints into mid- 


points, conjugate points into conjugate points, geodesic arcs into geodesic arcs, and 
straight lines into straight lines. 











86 DAVID GANS 


To arrive at our definition of a translation let us suppose that a motion M has a 
fixed line L (the existence of motions with fixed lines is formally assumed later). 
If [ab] C Land M(a,b) = a’,b’, then M((ab]) = [a’b’] C L. Since L is congru- 
ent to a Euclidean circle and M preserves distance on L, it follows that if the 
oriented geodesic arcs [ab], [a’b’] have the same sense so will each oriented geode- 
sic arc [xy] of LZ and its transform [x’y’] = M([xy]) have the same sense, whereas 
if [ab], [a’b’] have unlike senses so will [xy], [x’y’] have unlike senses. Thus 
M is either sense-preserving or sense-reversing on L. 


DEFINITION 3. A translation (of =) along a line is a motion of = leaving that 
line fixed and preserving sense on it. (The existence of translations is assumed 
later.) 


23. The set of all translations along the same line forms a group. 


24. Each infinite sequence of translations along the same line has a subsequence 
converging to a translation along that line. 


Proof. If L is the line, each infinite sequence of translations along Z has a 
subsequence 7, converging to a motion T by Theorem 21. For any point p of 
L let T(p) = p’, Tn(p) = pa. Then p, C L, and p,-— p’ when no. Line L 
being a closed set, p’ C L, that is, 7(L) = L. If gC L, where 0 < pq < 1, let 
T(q) = @, T.(q) = qd. Then g,—q’. Since [pg], [fag,] have the same 
sense, [p’g’] has this same sense. JT must then preserve sense for all geodesic 
arcs of L, and hence be a translation along L. 


25. A translation along a line leaving a point of that line fixed leaves each point 
of the line fixed. 


26. A translation leaving fixed each of two non-conjugate points leaves fixed each 
point of their line. 


DEFINITION 4. Distinct translations S, JT along the same line are called 
equivalent along the line if S(x) = T(x) for each point x of the line. 


27. Distinct translations S, T along the same line are equivalent along the line if a 
point p exists on the line so that S(p) = T(p). 


Proof. Let L be the line, x any point of it, S(p,x) = ¢,x’, and T(x) = x”. 
Then T'S—'(q,x’) = g,x’’, the translation T being applied second. TS leaves 
each point of L fixed by Theorems 23, 25. Hence x’ = x”. 


28. Every motion (and hence every translation) has at least one fixed point. 


Proof. A motion being a continuous mapping and > being a projective 
plane, the assertion follows from the fact that a continuous mapping of a pro- 
jective plane into itself has a fixed point [3, p. 80]. 


29. A translation along a line having no fixed point on that line has one fixed 
point all told. 





\XIOMS FOR ELLIPTIC GEOMETRY 87 


Proof. Let T and L be the translation and line, respectively. A point a 
exists so that T(a) = a. Let b # a, T(6) = b. Then T(-ab-, L) = -ab-, L by 
Theorem 22. Let -ab-, L meet in c. Then 7(c) = c, which contradicts the 
hypothesis. 


Axiom 6. Distinct lines G, H exist, each with the property that if a, b are any 
points on it (not necessarily distinct), there are exactly two distinct translations 
along it sending a into b. 


30. There is just one translation along G other than the identity leaving each 
point of G fixed. 

This translation is denoted by R, the identity by J. (A corresponding as- 
sertion, of course, holds for H. For brevity we shall usually state things only 


in terms of G.) 


31. If S, T are equivalent translations along G, then S* = T* and ST = TS. 
Furthermore, TS-' = S-'T = R. 

Proof. Let S(p) = T(p) = gq, where pC G. Then T-'S(p) = p. Hence 
7S, a translation along G, leaves each point of G fixed. Suppose 7-'S = J. 
Then 

T(T'S) = TI = T, 
so that (JT7-')S = T, or IS = T, and finally S = 7, which is a contradiction. 
Hence 7-'S = R. Likewise 
TS* = S"T = ST'=R. 

From S7-!'=S-'T and ST-! = T-'S, respectively, we get S* = 7J* and 
ST = TS. 

32. If S, T are equivalent translations along G with no fixed point on G, they have a 
common fixed point, but no other fixed point. 

Proof. S and T have unique fixed points by Theorem 29, which we denote 
by f and g, respectively. Suppose f # g. Let S(g) = 2’, TY) =/’. Then 
g~g',f#f’. Also 

S*(f) = T*(f), 
or T(f’) = f. Likewise S(g’) = g. By Theorem 31, ST(f) = TS(f), or S(’) = f’, 
which contradicts the fact that S has f as its only fixed point. Hence f = g. 


33. All translations along G have a common fixed point, to be denoted by g. 


Proof. Let a, a; be points of G with aa, = 4, and 7; a translation along G 


such that 7;(a,g) = a.,g, where g is the fixed point of 7;. Let a. = mid(a,a;) 
and 72(a) = a2; a3 = mid(a,a2) and 73;(a) = a3; and in general a, = mid(a,a,_1) 
and T,(a) = a,, where n >1. Then 7,(g) = g for all positive integers m. For 


any point x, where axa;, we can construct a translation S from the translations 
T, and their limiting translations such that S(a,g) = x,g. The powers of all 











88 DAVID GANS 


such translations S send a into all the points of G. Now the totality of trans- 
lations sending a into all the points of G is identical with the set of all translations 
along G. Hence if y, z are any points of G, at least one of the two translations 
along G sending y into z leaves g fixed. Call this translation U, and let V be the 
equivalent translation along G. If y # z then, by Theorem 25, U and V have 
no fixed point on G; from Theorem 32 and the fact that U(g) = g we then infer 
that V(g) = g. If y = 2 we note that R and J are the only translations along G 
sending y into z, and that I(g) = g. To show that R(g) = g let To, T, be any 
pair of translations equivalent along G, but with no fixed point on G. Then 
T,—'T> = R by Theorem 31, and 7o(g) = 7.(g) = g, as just shown above. 
From this we see that R(g) = g. 


34. Each point of G is conjugate to g. 


Proof. gx is constant for any point x on G by Theorem 33 and Axiom 6. 
Assume gx < 1. Now R(x,g) = x,g. It follows from Theorem 26 that R leaves 
fixed each point of -gx-, and hence, by Theorem 19, each point of =, so that 
R= TI. From this contradiction we infer gx = 1. 


35. The translation R has no fixed points other than g and each point of G. 
We let 4 denote the common fixed point for all translations along H. 


36. The points g and h are distinct, and g is on H if, and only if, h is on G. 
zg g : . 


V. ROTATIONS, POLES AND POLARS, AND REFLECTIONS 


DEFINITION 5. A motion leaving a point c fixed is called a rotation about c. If 
for all points x, y such that xc = ye a rotation about c exists sending x into y, 
we say that all rotations about c exist. 


37. All rotations about g and h exist. 


Proof. Considering only g, let a, b be any points such that ga = gb. Ifa 
is on G, so is b, in which case a translation along G, that is, a rotation about g, 
exists sending a into b. Suppose a and b are not on G. Let -ga- # -gb-, let -ga-, 
-gb- meet G in a’, b’, respectively, and let S, T be the distinct translations along 
G such that S(a’) = T(a’) = b’. Now each of these translations sends a into a 
point of -gb’- whose distance from g equals gb. Let b, b’”’ be the two points of 
-gb’- at distance gb from g. S and T cannot both send a into b”’, for suppose 
S(a) = T (a) = 5”. Since S(a’) = T(a’) = dD’, we have ST7T-(b’"_ b”’) = dD’, b”’. 
Hence ST", a translation along G, has b’, b’’, as well as g, as fixed points. By 
Theorem 25, then, each point of G is fixed under ST—', and from Theorem 35 we 
see that ST7-' = J, and hence that S = 7. Since this contradicts the fact 
that S ¥ T, we infer that S, T cannot both send a into 6”. Similarly they 
cannot both send a into b. Hence either S(a) = 6 or T(a) = b. Finally, let 
-ga- = -gb-. Then R(a) = b, for R leaves -ga- fixed, but not point a, by 
Theorem 35. 


AXIOMS FOR ELLIPTIC GEOMETRY 89 


38. If all rotations about a point p exist, and a motion exists sending p into a 
point q(+# p), then all rotations about q exist. 


Proof. Let c,d be any points such that gc = gd, and M a motion such that 
M(p) = q, M-(c,d) = a,b. Then pa = pb. If N be a rotation about p such 
that N(a) = b, then MNM~—'(q,c) = g,d. 
39. All rotations exist about some point of G. 


Proof. \f g C H the assertion is a consequence of Theorems 36 and 37. 
If g Z H, take point c(# g) on -gh- so that hc = hg. A rotation exists 
about A sending g into c; hence all rotations about c exist. Take d(# h) on 
-gh-, so that cd = ch. There is a rotation about c sending h into d; hence all 
rotations about d exist. Thus we obtain a sequence of points c,d,e, . . . on -gh- 
about each of which all rotations exist. The function hx, where x ranges over G, 
attains its maximum and minimum at points of G since G is a closed set and 2 
is compact. Let Gand H meet at p; then hp is such a maximum, with the value 1. 
Let a be a point of G such that ha is a minimum of hex. 

Case 1. gh > ha. Then hp > gh > ha since 1 > gh. Since G is connected 
and closed, hx takes on all values between its maximum hp and minimum ha. 
Hence fér some point x = x’ we have hx’ = gh. A rotation about / exists 
sending g into x’, so that all rotations exist about x’, a point of G. 

Case 2. gh < ha. Let -gh- meet G in r, whence gh + hr = gr. Then ha < hr, 
so that gh < hr. The latter relation may be written 


(1) hr = K-gh + Fgh, 
where K is a positive integer and 0 < F <1. We then have 


hr = K-gh + gh — (1 — F)gh, 


or 
(2) hr + (1 — F)gh = (K + 1)gh. 
Since 1 — F > 0 we infer from (2) that 
(3) hr < (K + 1)gh. 
Add gh to each side of (1), obtaining 

hr + gh = (K + 1)gh + Fgh 
or 

gr = (K + 1)gh + Fgh. 

Since F-gh > 0, we get 
(4) (K + 1)gh < gr. 
From (3) and (4) we have 


hr < (K + 1)gh < gr. 
Since ha < hr we get 


ha < (K + 1)gh < gr. 








90 DAVID GANS 


Taking N = K + 1, and noting that gr = hp = 1, we obtain 
ha < N-gh < hp. 


Now we take that one of the points c,d,e,... mentioned previously whose 
distance from h equals N -gh, and denote the point by y. As shown in Case 1 
there exists a point on G, which we denote by z, such that hy = hz. Hence a 
rotation about / exists which sends y into z. Since all rotations exist about y 
they likewise exist about z. 


40. All rotations exist about each point of G and H. 


DEFINITION 6. The locus of points conjugate to any point # is called the polar 
of p, and is called the pole of the locus. 


41. Gand H are the polars of g and h, respectively. 


42. If a motion sends a point p into a point q, it sends the polar of p into the 
polar of q. 


43. The polar of each point of G or H is a straight line. 


Proof. Let x be any point of G, and r its conjugate on G. Since xg=xr=1, 
a rotation exists about x sending g into r. Some translation along G sends r 
into x. Hence a motion exists sending g into x, and hence G into X, the polar 
of x. Then X is a straight line by Theorem 22. 


DEFINITION 7. A group of motions of a metric space into itself is transitive if 
for each pair of points a,b of the space there exists a motion of the group sending 
a into b. 


44. The group of all motions of = is transitive. 


Proof. Let x,y be any distinct points, any point of G, P the polar 
of p, and g the intersection of Gand P. Let x’ C Psothat xg = x’g. A rotation 
exists about g sending x into x’ by Theorem 37. Since px’ = pg = 1, a rotation 
exists about p sending x’ into g by Theorem 40. Let y’ C G so that py = py’. 
Some translation along G sends g into y’, and a rotation exists about p sending 
y’ into y. The resultant of these four motions is a motion sending x into y. 


45. All rotations exist about each point of =. 

Proof. This follows directly from Theorems 37, 38, 44. 
46. The polar of each point of = is a straight line. 
47. Distinct points have distinct polars. 


Proof. Let a,b be any distinct points, with polars A,B, respectively. If 
b CA, clearly A ~ B. If b ZA, let -ab- meet A in c. Then ac = 1 and 
abc. Thus ab + dc = ac, so that bc < ac. Since bc, the distance from b to a 
point of A, is not 1 we infer that A + B. 


AXIOMS FOR ELLIPTIC GEOMETRY 91 


48. Each straight line of = is the polar of some point. 


Proof. Let C be any line, a, distinct points of C, and A,B the polars of 
a,b, respectively. A,B meet in a point c. Then ca = ch = 1 since c is on A 
and B. Hence the polar of c must contain a and b, and must therefore be -ab-. 
Thus C is the polar of c. 


DEFINITION 8. A k-dimensional linear subspace of an S. L. space is any closed 
k-dimensional set (of the space) which, if it contains any two distinct points, 
also contains the geodesic through them. An involutory motion is a motion, not 
the identity, whose square is the identity. An involutory motion M of an S. L. 
space is called a reflection in the linear subspace S when all points of S are fixed 
under M, and S is maximal, i.e., it is not a proper subset of any other linear sub- 
space whose points are fixed under M [2, pp. 113, 179]. 


49. Each straight line is a 1-dimensional linear subspace of >. 


50. Jf pis any point, and P its polar, one of the rotations about p is a reflection 


Proof. Let p = g. Then R® = J* by Theorem 31, or R? = J. By Theorem 
35 only g, apart from each point of G, is fixed under R. Thus R is a reflection 
in G by Theorem 49. Now let p # g, and M(p) = g, M being a motion. Then 
M(P) = G. Let x be any point of P, y any point between x and , and 2(# y) 


the point on -xp- such that py = pz. Let 


M(x,y,2) = xy 2 


in which case M(-xp-) = -x’g- and y’,z’ are on -x’g-, with gy’ = gz’. Then 
MRM (p,x) = p,x. 

Thus the motion 1/-'RA leaves p, and also each point of P, fixed. But it leaves 

no other point of = fixed. For, suppose M~-'RM(y) = y, where y is any point 

of = not on P and distinct from p; then RM(y) = M(y), or R(y’) = y’, where, 

as above, M(y) = y’. But this contradicts Theorem 35. Now M-'RM # I, 


otherwise R = J. Also R(y’) = 2’ since gy’ = gz’. Hence 


MRM (p,x,v,2) = p,x,2,, 


so that (M-'RM)? = I. Also M~-'RM leaves fixed each point of the linear 
subspace P, and P is maximal. Hence M-'RM is a reflection in P, and since it 
leaves fixed it is also a rotation about p. 


From Theorems 48 and 50 we then obtain: 
51. A reflection exists in each straight line of =. 


DEFINITION 9. A metric space is called homogeneous if and only if it is congru- 
ent to a Euclidean, hyperbolic, or elliptic space of finite dimension. 


52. = is congruent to a two-dimensional elliptic space. 











92 DAVID GANS 


Proof. An S. L. space is homogeneous if a reflection exists in each geodesic 
[2, p. 181]. Hence = is homogeneous by Theorems 17 and 51 and, being a pro- 
jective plane, must be congruent to a two-dimensional elliptic space. 


VI. A FINAL REMARK ON > AND S. L. SPACES 


Busemann states that if all translations exist along two geodesics of a closed 
S. L. plane, the metric of the latter is elliptic [2, p. 219]. The proof of this, 
which was left to the writer, can now be supplied. For brevity we merely out- 
line its main features. First, we can use results in [2] to show that Axioms 1 to 
5 for = are valid propositions in any closed S. L. plane S. Thus S is a compact 
metric space, two points at maximum distance have exactly two midpoints, etc. 
Then, noting that Busemann’s translations are defined somewhat differently 
than are translations in 2, we can show, nevertheless, that the assumption that 
all of Busemann’s translations exist along a geodesic of S implies that exactly 
two translations as defined in = exist along it, sending an arbitrary one of its 
points into an arbitrary one of its points. Then, whenever all Busemann’s 
translations exist along two geodesics of S, so do all translations exist along them 
in the sense of Axiom 6. It follows that Axioms 1 to 6 are valid propositions in 
S if all Busemann’s translations exist along two geodesics of S. The metric of 
S would then be elliptic by Theorem 52. 


REFERENCES 

1. L. M. Blumenthal, Metric characterization of elliptic space, Trans. Amer. Math. Soc., 
vol. 59 (1946), 381-400. 

2. Herbert Busemann, Metric methods in Finsler spaces and in the foundations of geometry (Ann. 
Math. Studies, No. 8, Princeton, 1942) 

3. W. Fenchel, Elementare Beweise und Anwendungen eciniger Fixpunktsdtze, Mat. Tids., 
B (1932), 66-87. 

4. K. Menger, Untersuchungen tiber allgemeine Metrik, Math. Ann., vol. 100 (1928), 74-113 


New York University 


ON THE GEOMETRY OF LINEAL ELEMENTS ON 
A SPHERE, EUCLIDEAN KINEMATICS, AND 
ELLIPTIC GEOMETRY 


J. M. FELD 


1. Introduction. The geometry of slides and turns of oriented lineal elements 
in the plane was first studied by Kasner [10]. Slidesand turns generate whirls, 
which constitute a three-parameter group W;. The product of W; and M;, the 
three-parameter group of Euclidean displacements in the plane, yields a six- 
parameter group of whirl-motions' Gs. The geometry of turbines*, and also of 
general series of lineal elements, under Gs was investigated by Kasner in [10] 
and, in subsequent papers, by Kasner and DeCicco, particularly in [3], [4], [11], 
[12]. The author investigated the geometry of series of lineal elements under 
the seven-parameter group of whirl-similitudes G7 (of which Ge is a subgroup) 
in [6], [7], [8]. Among other things, the author showed that G; is isomorphic to 
the group of collineations of the points in guast-elliptic three-space, the geometry 
of which had been previously studied by Blaschke [1], [2] and Griinwald [9]; he 
also showed how the geometry of W3, Gs, and G; can be interpreted kinematically 
as the displacement of one plane over another. 

In this paper we investigate the geometry of spherical whirls and whirl- 
rotations of oriented lineal elements on a sphere. Some resultsin this field have 
already been obtained by Strubecker [15], who mapped the points of elliptic 
three-space E; one-to-one upon the oriented lineal elements of a unit sphere. 
Using synthetic methods, Strubecker deduced, from the geometry of lines in E,, 
theorems on spherical turbines and families of curves on a sphere, analogous to 
others found by Kasner for the plane [10]. We pursue the geometry of whirls 
and whirl-rotations on a sphere in other directions and by means of other 
methods. With the aid of quaternions we shall investigate the differential 
geometry of series of lineal elements on a sphere subject to two groups, Ws 
and @,—analogous respectively to W; and Ge in the plane—determining their 
fundamental differential invariants and “‘Serret-Frenet formulae.’’ Our princi- 
pal objective is to present a characterization of the geometry of whirls and 
whirl-rotations on a sphere in terms of the kinematic geometry of continuous 

Received October 16, 1950; presented to the American Mathematical Society February 
25, 1950. 

‘Slides and turns of non-oriented lineal elements in the plane had been previously used by 


Scheffers [14] in an investigation of certain groups of contact transformations. Whirls are 
not contact transformations. 
2A turbine is a series of oriented lineal elements the points of which lie on a circle (which 
may be a point circle), and the (oriented) lines of which are tangent to a concentric oriented 
circle. 
Turbines in space were studied by A. Narasinga Rao [13] and Feld [5]. 
93 











94 J. M. FELD 


displacements of one unit sphere over another, similar to the kinematic in- 
terpretation we gave in [8] of whirls and whirl-motions in the plane in terms of 
continuous displacements of one plane over another. The use of quaternions 
has the advantage of making it particularly easy to map oriented lineal elements 
on a sphere into the points of Z;. We indicate by means of this mapping how 
the differential geometry under G, of series on a sphere can serve as a model for 
the geometry of curves in £3. 


2. Whirl-rotations and turbines. Let the unit sphere S have its centre at 
the origin O of a right-hand orthogonal coordinate frame fo. Ifan oriented lineal 
element ¢ is tangent to S at the point P, we shall call the great circle through P 
tangent to e and oriented like e the great cycle of e. Let the lineal element ey have 
its point at (1,0,0) and let it be directed so that its great cycle passes through 
(0,1,0) and is oriented in the counter-clockwise sense, when viewed from the 
point (0,0,1). We shall call ¢) the primitive lineal element on S, and fy its 
associated frame. 

Let éo, €1, €2, €s be the quaternion units such that 


Col, = Co = C;7 €1 = €2 = €3 = €1€2¢3 = — 1; 
and let 


X = Xolo + X1€1 + X2e2 + Xs€s, E = Xolo — X1€1 — X2l2 — X36. 
Then a rotation of S around an oriented diameter is given by Hamilton’s formula 
N(x)u* = ux, N(x) = x, 


where u and u* are unit vectors emanating from O. The components x, of x are 
the homogeneous Euler parameters of the rotation. If e¢ is any lineal element 
on S and x is the quaternion of the rotation ¢) — ¢, we shall call the components 
of x the homogeneous coordinates of e. For convenience, when no confusion will 
result, we shall let the quaternion x designate both the rotation ¢) — e and the 
lineal element e. Evidently, if x designates e, so does kx, where & is a non-zero 
scalar. If N(x) = 1, we shall call x a normalized quaternion and represent it 
in bold type: x. To any quaternion x, N(x) + 0, correspond two normalized 
ones, namely + x/[N(x) ]’. If the rotation x rotates ¢) > e around the unit 
vector v through the angle 26, we can let 
x = — cos@+ vsin@é 
and 


— xX = — cos (x — 0) + ésin (2 — 8) (6 = — v). 
The rotation x also rotates fy into another Cartesian frame f, situated relative 
to ¢ as fo is situated relative to ¢); we shall call f the frame associated with e. 
A lineal element transformation x — x* given by the equation (in which we 
suppress the factor of proportionality) 


(2.1) x* = xa, 


where 


EUCLIDEAN KINEMATICS AND ELLIPTIC GEOMETRY 95 


a= —cosa+usina (u" = — 1), 


represents a rotation of all the lineal elements on S around the unit vector u 
through the angle 2a. We shall call such transformations lineal element rotations, 
and we shall let the quaternion a represent the lineal element rotation (2.1). 
The lineal element rotations constitute a three-parameter group Ws. 

A lineal element transformation x — x* whereby every lineal element x is 
rotated through the same angle 28 around a unit vector u,, situated relative to 
the frame f associated with x as an arbitrarily given unit vector uo is situated 
relative to fo, shall be called a (spherical) whirl. 

Let the rotation around u» through the angle 28 be denoted by the quaternion 
b where 

b = — cos 8 + usin 8B. 
Then x*% = xZb, so that the whirl x — x* is given within a factor of proportion- 
ality by the equation 


(2.2) x* = bx. 
The whirls constitute a three-parameter group of lineal element transformations 
W,;, skew-isomorphic to Ms. 

The product of a whirl and a lineal element rotation is commutative. A 
transformation which is the product of a whirl and a lineal element rotation 
shall be called a whirl-rotation. Whirl-rotations x — x* are given by the equation 
(the factor of proportionality being suppressed) 

(2.3) x* = bxa. 
The whirl-rotations constitute a six-parameter group W,. 


Let the symbol (x,y) represent the scalar product of two lineal elements x and 
y, defined as follows: 


(x,y) = $09 + y2). 
Since x7 + y& = Zy + @x, we also have 
(x,y) = 3(@y + gr). 


With the aid of this definition we obtain the following useful equalities: 


(x,y) = (y, x), (x, x) = N(x) = x, 
(ax, y) = (x, ay) = a(x, y) (a a scalar), 
(2.4) (x,y +2) = (x, vy) + (x, 2), 
(bxa, bey) = (a, a)(b, b)(x, y). 
Since V(xy) = 1, we can let xy equal either — cosé + v siné or — cos(x — 4) 
+ dsin(r — 6), v? = —1. Therefore (x,y) = —cosé in the former case 


and — cos(x — 6) in the latter. Thus cosé = + (x,y). We shall call 6 and 
x —6(0<6 < =), the distances between x and y: x and y coincide only when 
6 = O0orr. 











96 J. M. FELD 


Evidently 


; vos? § = 29)" 
(2.5) cos 8 = x) 9) 





If (x,y) = 0, in which case 6 = $2, we shall say that x and y are orthogonal. 
If we subject x and y to the whirl-rotation (2.3) we obtain, by virtue of the 
last equation in (2.4), 


(x*, y*) = (@,a) (6,5) (x, y). 

This yields 

THEOREM 2.1. Under the group of whirl-rotations a pair of elements x and y 
have an invariant cos? 6, given by (2.5), 5 andx — 6 being the distances between 
x and y. 

Let the lineal elements x and y be distinct, that is, cos?6 # 1. The @! lineal 
elements z defined by the equation 
(2.6) z= ax + By (a, » real scalars, a” + 6° < 0), 


shall be called a linear series of lineal elements. From (2.6) we obtain with the 
aid of (2.4) 


(x,2z) = a(x,x) + B(x, y), 
(y, 2) = a(x, y) + BY, 9), 
(z, 2) = a(x, z) + B(y, 2). 
Eliminating a and 8, we obtain 
| (x, x) (x, y) (x, 2) | 
(2.7) D = | (y,x) (vy, ¥) (y, s) | = 0. 


(z, x) (z, y) &, 2 


The following theorems can now be easily established. 


THEOREM 2.2. Two distinct lineal elements determine a linear series. 


THEOREM 2.3. A necessary and sufficient condition that three lineal elements 
x,y and z lie on a linear series is that D = 0. 


Let g, N(q) = 1, be a given lineal element, and let a = — cos 6+ 7 sin @, 
where 7 is a constant unit vector and @ is variable. The lineal element 
(2.8) x=ga, N(q) = 1, 


is obtained by rotating g around the vector r through the angle 26. It will be 
convenient hereafter to let a unit vector v designate also the point on S that has 
for its Cartesian coordinates the components of v. As @ varies from 0 to z, x 
describes a series of lineal elements, the points of which lie on a circle ¢ (which 
may be a point circle) having its centre at r, and the great cycles of which make 


EUCLIDEAN KINEMATICS AND ELLIPTIC GEOMETRY 97 


the same angle with c. Such a series shall be called a spherical turbine; c shall 
be called the circle of the turbine, and the points r and — r the centres of the 
turbine. If we select three lineal elements (2.8) by assigning three arbitrary 
values to 6, we find that their quaternions satisfy (2.7); consequently, spherical 
turbines are linear series. 
Let us define 
| = qrq. 
Evidently / is a constant unit vector. Since 
xr = (ga)r(qa) = q(ard)G = grg = l, 


the turbine TF defined parametrically by means of (2.8) has the non-parametric 
equation 


(2.9) Zix =r, (xé = 1). 


But the equation of T can also take the form Z( — /)x = —r; therefore, the 
lineal elements of T are represented by those quaternions x which correspond 
to the rotations of the unit sphere S that carry point / to r, and point — / to 
—r; that is to say, the quaternions x correspond to the rotations of S that 
carry the oriented diameter — ]/—/ into — r-r. 


3. The kinematic representation of turbines. Let us consider two concentric 
unit spheres S, (the left sphere) and S, (the right sphere). Let the pair of dia- 
metrically opposite points / and — / lie on S,, and let the pair of points r and 
— rlie on S,. We can now map the turbine T upon two ordered pairs of points 
on S, and S,, namely, /,r and the diametrically opposite pair —/1,—r. We 
shall call 1.7 (or, alternatively, — 1, — r) respectively the left and right coordinates 
of Z, and let either of the symbols [/,r] or [— /, — r] represent T. Let this 
mapping whereby every turbine T on S corresponds to two pairs of image points 
on S, and S, be called the kinematic representation 4%. We can make * one- 
to-one by orienting the turbines on S. With every turbine T we associate two 
oriented turbines T+ and I~ by assigning to T+ the centre r and to T~ the centre 
— r. Aone-to-one kinematic representation of oriented turbines is brought about 
by choosing the pair of points /,r as the image and [/, 7] as the symbol of T+, and 
the pair — /, — r as the image and [ — /, — r] as the symbol of T-. The simul- 
taneous reflection of the points on S; and S, in their common centre corresponds 
to a reversal of the orientation of the turbines on S. 

Let &: [/,r] be the turbine determined by the two lineal elements x and y. 
The parametric equation (2.8) of T yields 


y = x(— cos@ + rsin 8). 
Since 
xy + yx = — 2cos@, 


6 is a distance between x and y; moreover, since 











98 J. M. FELD 


xy — yx = 2rsin @ = 2r sind, 


where 6 is either distance between x and y, we obtain 


THEOREM 3.1. The turbine determined by the lineal elements x, y (x # + y), has 
turbine coordinates |, r given by the formulae 


yi — xij — : ty — 9x on 
| ee ee sc " = — —— -—— “sc 
2{ (x, x)(y, »)}# 2} (x, x) (y, »)}} 


where 6 is a distance between x and y. 


o 


Evidently all turbines have the same “‘length’”’ z. 
Because equation (2.9) can be regarded as a necessary and sufficient condition 
for the incidence of a turbine [/,r] and a lineal element x, we obtain 


THEOREM 3.2. To the ~* oriented turbines I incident to a given lineal element x 
on S correspond, by virtue of the one-to-one mapping 4, ~* left image points | on 
S, and ~* right image points r on S,, so that the rotation of S, that corresponds to 
the quaternion x brings the ~? left image points into coincidence with their ~* 
associated right image points. 


The whirl-rotation (2.3) transforms [/, 7] into [/*,r*] where /* = b/b, 
re =ara. 

The set of ? lineal elements orthogonal to a given lineal element u shall be 
called a planar field The elements x of this planar field § are given by the para- 
metric equation 
(3.1) x=ua, a+dad=0, aé=1. 

Eliminating a we obtain the non-parametric equation of §: 

(3.2) tix + iu = 0. 

The four components of « determine § and shall be called the homogeneous 
coordinates of §. 

If the lineal elements y and <z lie in the field u, x = ay + Bz (a and 8 real 


scalars) satisfies (3.2) identically. Therefore the turbine determined by y and z 
lies in u. 


By means of the whirl-rotation x — bxa the planar field u is transformed into 
the planar field u* where 


(3.3) u* = bua. 


Hence we obtain 
THEOREM 3.3. Whirl-rotations transform planar fields into planar fields. 


We can regard (3.3) as the equation of G, in planar field coordinates. 

If the lineal elements y and z determine the turbine [/, 7], then, in order that 
{/, r] lie in the field u, it is necessary and sufficient that y and z satisfy (3.2). 
Hence 


as 


mn 


EUCLIDEAN KINEMATICS AND ELLIPTIC GEOMETRY 99 


and therefore 


zy = usju, gz = Uyzu; 
consequently 


gz — 2y = U(yz — 2H)u. 
Using the formulae for / and r given in Theorem 3.1 we obtain 
(3.4) tu =-r 
as a necessary and sufficient condition that the turbine [/, 7] lie in the field w. 


We now have 


THEOREM 3.4. By means of 7 the ~?* oriented turbines that lie in a planar 


field u are mapped upon pairs of points l, r on S, and S, respectively, so that a 


symmetry (that is, an improper orthogonal transformation) will transform the left 
image points into their corresponding right image points; the homogeneous Euler 
parameters of this symmetry are the coordinates of the planar field u. 


The companion Theorems 3.2 and 3.4 justify calling 47 a kinematx 
representation. 

Inasmuch as planar fields are transformed like lineal elements by whirl- 
rotations, we define the angles ¢ and x — ¢, 0 < @ < x, between the two planar 
fields u and v by an expression dual to that used for the distances between two 
lineal elements, namely, 

(3.5) cos’ ¢ = ee j 
(u, u)(v, v) 

Let the lineal element u be called the pole of the planar field u. The following 

theorems are now easily established. 


THEOREM 3.5. The angle between two planar fields is equal to the distance 
between their poles. 


THEOREM 3.6. Two planar fields u and v intersect in a turbine |l,r| where 


(ud — vii) csc > (iv — du) csc @ 


’ f= 
2{(u, u)(v,v)}4 24 (u, u)(v,v)}3 


and ¢ is either one of the angles between u and v. 


If x, y and z are three linearly independent lineal elements, there exists a 
unique lineal element u orthogonal to all of them. Since u must satisfy the 
equations 


ix + fu = ty + gu = iz + Zu = 0, 
the components of u are given by 


Uo: :Ue3s = 





X1V223 | > —— |XoVoe3| - |\XoVi2s| - — |XoVi2e}. 


This yields 











100 J. M. FELD 


THEOREM 3.7. A planar field is determined by three linearly independent lineal 
elements. 


THEOREM 3.8. If x, y, and z are linearly independent lineal elements, the ~* 
lineal elements 
w=ax+ By+ yz (a, 8, y real numbers) 


constitute the planar field determined by x, y, and z. 


4. Differential invariants of series of lineal elements under %, and 9;. 
A series of lineal elements on S is a one-dimensional extent of lineal elements 


defined by 
(4.1) x = x(t) 
where / is a real parameter. We assume that dx/dt ~ 0 in the interval /; < t < te 


and that x(¢#) has a continuous second derivative. We can, without loss of 
generality, also assume that x(¢) is normalized, that is, that 
(4.2) x(t)#(t) = 1. 
In addition we assume, as we may, that the quaternions a and b} that appear in 
the lineal element rotation x — xa and whirl x — bx are normalized, so that 
normalized series are transformed by these transformations into normalized 
series. 

Let the whirl } transform the series GS: (4.1) into the series S*: x*(#). Then 


do* = didx = di*dx* 


[s dx d% ; 
o= | - -} dt 
Jt \dt dt 
the Y3-arc length of S measured from f) to ¢. Let the equation of S be expressed 
in terms of the invariant parameter o. Then, letting x’ = dx/do, we have 


is invariant. We shall call 


(4.3) E(o)x(o)=1, €x =1. 
Evidently 

(4.4) Z = ix 

is a differential invariant under Y%;. Equations (4.3) yield 

(4.5) ix +#x=0, N(x) = N(x’) = 1. 


Therefore Z(c) is a unit vector. The three components of Z, namely 
Z; (i = 1,2,3) where }-Z; = 1, are differential scalar invariants of S under W;. 
We proceed to find geometric interpretations for ¢ and Z. Let us consider 
the distance Aé between the lineal elements x(c) and x(¢ + Ac) on S. Since 
2 cos Aé = £(c)x(o + Ac) + &(¢ + Ac)x(c) 


(see $3), we obtain 


we 


Ww 


EUCLIDEAN KINEMATICS AND ELLIPTIC GEOMETRY 101 


(2) re 

\ 75 = — (&&x + Z x). 
But differentiation of the first equation in (4.5) yields 
(4.6) ix +#x= —2. 


Hence 
da 


Next, let us consider the turbine T:[/,r] with centres at r and — r, tangent to 
S at oo. By Theorem 3.1 


+ dé. 


r = lim 4(Z(o0)x(oo + Ac) — £(a9 + Ao)x(a0)] csc Ao. 
o-0 


Since Zx’ is a unit vector, we obtain 

r = $[Z(o0)x (0) —  (o0)x(a0)] = E(o0)x (oo) = Z(o0). 
Therefore Z(c) and — Z(c) are the loci of the centres of the turbines tangent 
to the series S. 

Under the group 2; we obtain the same invariant parameter o as under Ys, 
and we assume again that © is expressed in terms of this parameter. It is evident 
that the unit vector 
(4.7) W=xz 
is invariant under Jt;. The turbine tangent to S at a» has a left image vector / 
which, by Theorem 3.1, is given by 


l = lim 3[x(oo + Ac)#(o0) — x(00)#(o9 + Ac)] csc Ao 
= x (a0)#(o0) = W(a»). 


Consequently we obtain a geometric interpretation of the differential invariant 
W(c) of S under M;, namely, the locus of the left image point / of the turbines 
tangent to S. 

Differential invariants of S of higher order relative to YW; [Mts] result from 
differentiating Z(c) [W(c) ] with respect to c. 

To find kinematic interpretations for Z(¢) and W(c), we proceed as follows: 

Let S,; and S,, be two unit spheres concentric at O; S;, is fixed in position but 
S,, is mobile around O. Let e; be a primitive lineal element on S; and let F, be 
its associated rectangular Cartesian frame. Let e,, be an arbitrary (primitive 
lineal element on S,, and F,, its associated frame; e¢,, and its frame F,, are mobile 
with S,, relative to S; and e,. Lineal elements and points on S; will be referred 
to e,and Fy, but lineal elements and points on 5S,, will be referred both to ¢,, and 
¢,, or, what is equivalent, to their associated frames. Let the initial position 
of ¢,, and therefore also of S,,, relative to e, be given by the quaternion xo, 
namely, the quaternion of the rotation ¢e;—>e¢,. As S,, undergoes a continuous 
displacement Y around O, ¢, traces on S; a series S which, referred to e,, has 














102 J. M. FELD 


the equation x = x(¢), where xp = x(c) denotes the initial position of ¢,,. S&S de- 
fines completely the displacement Z7. But F can be defined as well bya 
series S* traced on S; by any other lineal element e*,, on S,. Let the quaternion 
that determines the position of e*,, relative to ¢, be x*»; then, if x*5 = bxo is the 
whirl e,, — e*,,, this whirl also transforms S — GS*. 

Let P be a point on S,, and let its coordinates, when referred to ¢,, be the 
components of the unit vector v. Then, referred to e, on S,, P has for its co- 
ordinates the components of the unit vector 


(4.8) V = ivx, 


because the rotation that transports e;—>e¢, transforms v— V. During the 
motion x(¢) of S, the vector V describes a cone, the intersection of which with 
S,is the trajectory that the point P of S,, traces on S,. To find the poles (instan- 
taneous centres of rotation) of the motion, we seek those points V on S, for which 
V"(o) = O. From (4.8) we get xV’ + x’V = vx’. Consequently V = Z’vx’, and 
therefore x’ VZ’ = v = xVZ. Hence 
V = €xVix, 

which implies that V is collinear with the vector x’. Therefore the locus of the 
pole on the fixed sphere (the fixed or space centrode) is + Z(c). 

The locus of the pole on the mobile sphere (the mobile or body centrode), 
referred to @m, is 


(4.9) v= xVE = x(4+ Z)é = + xfx F = txt = + Wc). 


During the displacement F defined by the series S, the curve W(c) on S,, 
rolls without slipping on the curve Z(c) on S;, while, of course, — W(c), dia- 
metrically opposite to W(c) on S,, rolls on — Z(c). The motion Y is com- 
pletely determined by the centrodes Z(c) and W(c), which, in turn, are de- 
termined by Z However, it should be observed that the equation of the 
mobile centrode is referred not to e, but to any lineal element of S, say to 
tm:x(oo). Therefore, if we replaced on S,,the primitive element ¢,, by another 
primitive element e*,,: x*, the motion Z that was defined by the series S traced 
out on S; by e,, would instead be defined by a series S* traced out on S; by e*». 
But now the mobile centrode would be referred to e*,, and, according to (4.9), 
would be given by W = + x*ZzZ*. Consequently the motion 7 is determined 
by the fixed centrode + Z(c¢) and an arbitrary primitive lineal element. If w 
is the whirl ¢,, — e*,,, w transforms the series S generated by e¢,, into the series 
S* generated by e*,,. Since S and S* define the same motion Y, and F is 
defined by + Z(c) and an arbitrary lineal element on S,,, Z(c) determines a 
series within a whirl. Wecan therefore regard Z = Z(c) as the intrinsic equation 
of a series relative to W;. 

The motion defined by a turbine [/, 7] is a continuous rotation of S,, around 
the diameter (— / — /), and therefore has for its fixed [mobile] centrode the pair 
of diametrically opposite points + r{ + /]. If Disa displacement defined by 


he 


EUCLIDEAN KINEMATICS AND ELLIPTIC GEOMETRY 103 


a series S other than a turbine, the fixed [mobile] centrode of 7 is the locus 
of the right [left] image points + r(c) [ + /(c) ] of the turbines tangent to S. 


5. Differential invariants of series under G,;. Let the series S have the 
equation 
(5.1) x = x(t), x(t)Z(¢t) = 1, 
where x(t) has a continuous third derivative in the interval t; < t < t, in which 
dx/dt = 0. Subjecting S to the whirl-rotation 
x* = bxa, N(a) = N(b) = 1, 
we obtain 
dx*d#* = b dx ad d%6b = dx di. 
We let the invariant dx di = do? as before, but now we designate 


_ ' (deat) 4 
Ali }t, \dt dt : 


as the G.-arc length of S measured from f) to ¢. Let the parameter ¢ in (5.1) 
be expressed in terms of o; then the equation of S becomes x = x(¢) where 
(5.2) x(oc)#(o) = 1, x (¢)é (oc) = 1. 

We shall consider only series x(c) for which 

(x ,x )—1#0 
in the interval o; < o < a2. The significance of this restriction will be ex- 
plained later. 

Let us associate with any lineal element ¢ of S in the interval (0,02) a frame 
composed of four mutually orthogonal lineal elements represented by the normal- 
ized quaternions £, (¢ = 1,2,3,4) in the following manner: 

According to (5.2), x and x’ are normalized quaternions. Moreover, 

(5.3) xEé +x = 2(x,x) = 0. 
Therefore x and x’ are orthogonal lineal elements. Let 
(5.4) £, = X, Es =X. 
The second equations in (5.2) and (5.3) yield 
(5.5) (x,x ) = 90, (x,x )=-—1. 
Let y = x + ax” where aisascalar. We seek a value for a for which (x, y) = 0. 
Since 
(x,y) = (x,x) + a(x,x )=1-—a, 
we obtain a = 1. Therefore y = x + x”, and y is consequently orthogonal to 
x and tox’. Now 


” 


N(y) = (x + x ,x+x ) = (x,x)+ 2(x,x )+(x ,x )= (x',x )—-1+0. 











104 J. M. FELD 


Let 
(5.6) = 


Thus £3 is a normalized quaternion that represents a lineal element orthogonal 
to & and to E>. 

Let z be the pole of the planar field determined by the linearly independent 
lineal elements x, x’ and y. Then 


(x,z) = (x,z) = (y,z) = 0. 


Since (y, z) = (x, 2) + (x, 2), we get (x”’,z) = 0. Therefore 


} 


_ | Xo *1 Xe Xs 





(5.7) xo xy x9 x3 
lz x) Xe x3 
and 
| (x, x) (x, x) (x, x ) i 0 —1 
N(z) = (x, x) (x, x ) (x, x)i=!] 01 0 = (x ,x )—1. 
| x) (x x) (x x’) | —1i1 0 (x ,x )| 
Let 
a a 
(5.8) 4 [ (x0"” x”! = 1}* 


Thus & is a normalized quaternion representing a lineal element orthogonal to 
the lineal elements £), &, and £3. 
Let d represent the determinant of the components of the four £, Then 


d* = |(&,, &,)| = 1 


Consequently the four normalized quaternions £, are linearly independent. They 
therefore constitute a linear basis for arbitrary quaternions. Hence we can set 
the four quaternions £,(= dt,/dc) equal to the linear combinations 


(5.9) Ey = anti + ate + ats + avuts (¢ = 1, 2, 3, 4). 
Since (&;, —;) =6;;, we obtain 
(5.10) (E,, &)) + (&, &)) = 0. 
Scalar multiplication of the equations (5.9) by the £; vields 
as = (&s,&;) = — (EE) = — ay. 


Therefore the matrix ||a;;\| is skew-symmetric. Furthermore, since ¢’; = £2, we 


find that 


ai2 = ss Qi3 = Ay = 0. 


(5.11) . = [(x ,x ) — 1}. 


EUCLIDEAN KINEMATICS AND ELLIPTIC GEOMETRY 105 


Then (5.4) and (5.5) yield 


, 1 
£> =—= &; + 
Consequently 
, . 1 
a23 = (Es, §&;) = - and aoa = 0. 
It remains to find the value of as, = (&’s, &4) = — (&4, &s). Since 


ts = p(x+x ), 
t= px+px + px + px’. 
Scalar multiplication of £’; by & = pz yields 
(Es, 4) = pp (x, 2) + p(x, 2) + pp (x, 2) + p(x, 2 
But z is orthogonal to x, x’ and x”’; therefore 
(5.12) (Ei, be) = ee", 2) = ——S 


_ 1 em (x”", x’’) 
where 


(5.13) A= Xo X1 Xe Xs 


Let 


(5.14) a = 


bi - E> 
(5.9*) h=—-& + +; 
p 
3; = = ots + te, 
i= = a 


This system of equations is the analogue for a series S under @, of the Serret- 
Frenet formulae for a curve in Euclidean space. We shall call 1/p and 1/r the 
@,-curvature and G¢-torsion of S respectively. Given two arbitrary functions 
p(c) and r(c), a series is determined within a whirl-rotation by means of (5.9*). 
We can therefore regard p = p(c) and + = r(c) as the intrinsic equations of a 
series relative to Gg. 

If S is a turbine, its parametric equation (see (2.8 


may be expressed in 
the form 


x = q(— cost+rsinf), qq = 1, y* ss - {. 











106 J. M. FELD 


where g and r are constant quaternions. Since (dx/dt) (d%/dt)=1,t=+¢ 
+ const. Consequently, if x(t) is a turbine, 


(x,x)=1 and (x ,x )-1=0 


where x’ and x” denote differentiation with respect to ¢. Conversely, it can be 
shown that a series S: x(c), such that 


(5.15) N(y) = (x ,x )-1=0, 


is a turbine. For (5.15) implies that y = x’ + x = 0. Therefore fx” + éx = 0. 
Consequently ¢x’’ = — 1. But the fixed centrode of S, namely + Z = + #x’. 
Therefore 


—“=fx +x =0, 


which implies that the fixed centrode of S is a pair of diametrically opposite 
points and consequently, that S is a turbine. Thus (5.15) is a necessary and 
sufficient condition that a series be a turbine; or, what is equivalent, a necessary 
and sufficient condition that S be a turbine is that 1/p = 0. Torsion is not 
defined for turbines. 

Reverting to the original parameter ¢, we find the following expressions for 
the differential invariants: 


. 1 (= dx\" 
* —=—-=@ a 
ny ) p dt at 
an 
(5.14*) a 
T Ww 
where 


-(# $y ee a ) -(¢ +4) ‘ (a 4) 
© Nae’ dt /\ dt?" dt? dt’ dt? dt’ dt)’ 
Ay = | ap, 8 See @xs| 

|" dt? dt” d®\" 

6. Kinematic and non-Euclidean interpretations of the differential in- 
variants p and r. Let the displacement -4, defined by a series S: x(¢), have 
+ Z(c) = + x’ for its fixed centrode. We assume that © is not a turbine, 
that is, that (x’’, x”) — 10. Let the unit vector 
(6.1) o, = ix 
have its initial point at O, the centre of the fixed sphere S,. Let s be the Eu- 
clidean arc-length of the fixed centrode Z(c) measured from oo toc. Then 

ds* = (Z', Z')do* = (£1, $1)do’. 


But Z’ = x” + #x’ = 1+ &”’. Therefore 


EUCLIDEAN KINEMATICS AND ELLIPTIC GEOMETRY 107 


és @ fhe”, 5") — 1) de" @ nde? 
p? 
We orient Z(c) so that ds/da = 1/p > 0. Now 


Ca 
_ = (fx + 2x FP = pi(x +x). 


Let 
dt, os..°* 
(6.2) fo = =— = pi(x +X). 


Evidently ¢2 is a unit vector tangent to the fixed centrode at the point c. 
Let ¢; be the vector product of £; and f). Then 
(6.3) C3 = 01 K Se = Sve. 
Therefore £3; is a unit vector tangent to S, and orthogonal to ¢; and {> The 
three orthogonal unit vectors ¢,; constitute a moving trihedral of the fixed 
centrode considered as a spherical curve. The vectors ¢',; (t = 1,2,3) being 
linearly dependent on the £;, we can let 
(6.4) f= Bati + Bats + Bats (¢ = 1, 2, 3; 8,, real scalars). 
Evidently the (Euclidean) scalar product ¢,-{; of the vectors {[,, ¢; is equal to 
(Fs f;). Since 06S; ee by, 
tres, - rly = 0. 

Hence ||8;,\| is skew-symmetric. From (6.2) we obtain 

1 

Bie = — Bu = -. 

p 
We proceed to evaluate 823; = {2-3 = (2, {s). Since ¢; and {2 are orthogonal 
vectors, (6.1), (6.2), and (6.3) yield 

C3 = fife = pxd E(x +x). 
Observing that Zx’ix’ = — 1, we obtain 
ft; = — p(x +#x). 
Moreover, 
fo = p#(x +x) + p(x +4) 

and, since (fe, {3) = 0, 


vfs = (fo, ts) = p(éx + 2x , fs). 
This can be reduced, by virtue of (2.4), (5.2), and (5.11), to 


“vr 


—1—p[(@& ,fx )+ (& ,#x)). 
To evaluate the scalar products in the square brackets, write 


Pig = RM — Ke Vig = My — Ky bey = Ky — Ky. 








108 J. M. FELD 


Observing that, by reason of (5.5), (x, x’”’) = 0, we obtain 


8 
R 
! 


= €:(Por — pos) + €2(Po2 — Psi) + €3(Pos — pis), 
= €:(gor — G23) + €2(go2 — 31) + €3(gos = gis). 


8 
8 . 
| 


Therefore 
(fx, #x ) = yi PiQis — > Piers 


where i,j and k,/ are complementary pairs of subscripts. But 


cue eared AX “ z -, ~: iad 
3 : BS. | 2.8 is .33 
Lo Pier = A. 

Therefore 

(€x ,fx )=—(x x )—A 
Similarly 

(@", x) = >> pitti; — Dd Pitter 
But i 

D Petey = | =) — ea ¥ | = (x, x 

oii (x ,x)(x ,x)I —(x ,x 0 


LD Pitter = 0. 


Hence (#x’"’, Z’x) = (x, x’"). Consequently 


, 2 1 
Bos = — Bae = pA-1=-1-- 


T 


Substituting the expressions we have found for the 8,; in (6.4), we obtain 


* , 
_— Heth - (1 + 1), 


If we change from the parameter ¢ to s (Euclidean arc-length), the system 


(6.4*) becomes 





dy _ . 

ds ” 

df» 1 
ary ee (eo): 


ae (1+) 


tt 





) 


EUCLIDEAN KINEMATICS AND ELLIPTIC GEOMETRY 109 


The system (6.4**) is the set of Serret-Frenet formulae of Z(s) regarded as a 
spherical curve. 

Evidently K; = — p(1 + 1/r) is the geodesic curvature of the fixed centrode, 
and represents the rate of turning (bending) of the plane tangent to the fixed 
(space) cone as its point of tangency with the fixed centrode moves on it with 
unit speed. If R is the radius of curvature of Z(s) regarded as a space curve, 
R* = Kf +1. 

In a similar manner, the Serret-Frenet formulae for the mobile centrode 
+ W(c) on S, can be found. Letting 7, be the unit vector W(c) = x’Z at the 
point ¢, m2 the unit vector tangent to W(c) at o, and 93 = 9:X 2 = m2, we 
obtain 


dm 
ds ws 


(6.5) dne _ 


1 
a °° ™ +01 = 1), 
B. -4-9 
ds p 7/™ 


where s is the Euclidean arc-length of the mobile centrode. The geodesic 
curvature of the mobile centrode is K,, = p(1 — 1/r). 

The geodesic curvatures K,(s) and K,,(s) determine the fixed and mobile 
centrodes on S,; and S,, respectively within rotations around O. Hence the 
differential invariants p(c) and r(c) which determine a series S within a whirl- 
rotation also determine, within rotations, the fixed and mobile centrodes of the 
continuous motion -4 defined by S. When K, and K,, are constant, the two 
centrodes are circles, and the motion -4 becomes that of a circle of radius 
(1 + K,,?)~* on S,, rolling without slipping on a circle of radius (1 + K/)~ on Sy. 

If we regard the four components x, of the quaternion x as the homogeneous 
coordinates of a point in projective three-space, we obtain a continuous one-to- 
one mapping of the lineal elements x on S upon the points x in three-space. By 
virtue of this mapping it is evident that to the * whirl-rotations (2.3) on S 
correspond the * displacements in elliptic space E;; indeed, to the whirls 
correspond the left-translations in EZ; and to the rotations correspond the 
right-translations. A series © as in (5.1) is mapped on a curve @ in E,, 
turbines being mapped on the straight lines. If S is not a turbine, the moving 
frame of lineal elements £; (¢ = 1,2,3,4) associated with S is mapped on a frame 
of four points associated with @ The invariants 1/p and 1/7 can be interpreted 
as the elliptic curvature and torsion respectively of @, and the equations 
(5.9*) become the Serret-Frenet formulae for a curve in Z£;. A continuous 
motion of S, over S; which corresponds within a whirl-rotation to a series S on 
S therefore also corresponds within an elliptic displacement to a curve @ in E;. 








110 





J. M. FELD 


REFERENCES 


1. W. Blaschke, Euklidische Kinematik und nichteuklidische Geometrie, Z. Math. Phys., vol. 
60 (1911), 61-91 and 203-204. 

2. ———,, Ebene Kinematik (Leipzig and Berlin, 1938) 

3. J. DeCicco, The geometry of whirl series, Trans. Amer. Math. Soc., vol. 43 (1938), 344-358. 

4. ———,, The differential geometry of series of lineal elements, Trans. Amer. Math. Soc., vol. 
46 (1939), 348-361. 

5. J. M. Feld, The geometry of whirls and whirl-motions in space, Bull. Amer. Math. Soc., 


vol. 47 (1941), 927-933. 

———., Whirl-similitudes, Euclidean kinematics, and non-Euclidean geometry, Bull. Amer. 
Math. Soc., vol. 48 (1942), 783-790 

—, On a representation in space of groups of circle and turbine transformations in the 
plane, Bull. Amer. Math. Soc., vol. 50 (1944), 930-934 

———.,A kinematic characterization of series of lineal elements in the plane and of their 
differential invariants under the group of whirl-similitudes and some of its subgroups, Amer. 
J. Math., vol. 70 (1948), 129-138. 





. J. Griinwald, Ein Abbildungsprinzip, welches die ebene Geometrie und Kinematik mit der 


rdumlichen Geometrie verknipft, S.B. Akad. Wiss. Wien., Ila, vol. 80 (1911), 677-741. 


. E. Kasner, The group of turns and slides and the geometry of turbines, Amer. J. Math., 


vol. 33 (1911), 193-202 


- E. Kasner and J. DeCicco, The geometry of turbines, flat fields and differential equations, 


Amer. J. Math., vol. 59 (1937), 545-563. 

———, The geometry of the whirl-motion group Gs: elementary invarianis, Bull. Amer. 
Math. Soc., vol. 43 (1937), 399-403. 

A. Narasinga Rao, Studies in turbine geometry I, J. Indian Math. Soc., vol. 3 (1938), 
96-108; II, Proc. Indian Acad. Sc., vol. 8A (1938), 179-186. 

G. Scheffers, Isogonalkurven, Aequitangentialkurven und komplexe Zahlen, Math. Ann., vol. 
60 (1905), 491-531. 


. K. Strubecker, Zur Geometrie sphdarischer Kurvenscharen, Jber. dtsch. MctVer., vol. 44 


(1934), 184-198. 


Queens College 
Flushing, N.Y. 


ON THE PROPERTY C AND A PROBLEM OF 
HAUSDORFF 


FRITZ ROTHBERGER 


1. Introduction. In an earlier paper [3] I studied the property C and re- 
lated properties C’ and C”; but the principal problem, viz, to prove, with the 
axiom of choice only (without any other hypothesis), the existence of a non- 
denumerable set of property C, remains open. 

In another paper [4] I studied Hausdorff's problem [1] of the existence of 
Q-limits for (transfinite) sequences of dyadic sequences, and we have some 
conditional results; but again the main problem remains open, viz, the problem 
of proving (with the axiom of choice only) the existence of such 2-limits. 

In the present paper we are going to solve, in a certain sense, a compound of 
these two problems. We are going to show that: either there exist 2-limits, or 
non-denumerable C-sets, or both (Theorem 1). We also prove two other 
theorems which are related. 

For the definitions and the general theory we refer the reader to the two 
papers mentioned above. We shall, however, repeat here those theorems which 
we are going to use explicitly, and those definitions where more than just the 
name occurs. 

We denote generically a finite set by A. Individual finite sets will be indicated 
with a superscript, such as A’,A*. If E C F + A, we shall write E < F (E is 
almost-contained in F). Whereas, in [4], these definitions were used for sets of 
natural numbers only, we shall use them here for other sets also, but only for 
subsets of a fixed denumerable set (e.g., the set of all rational numbers) and thus 
we shall still have the same theorems, mutatis mutandis. 

A set E is said to have the property C” if every double sequence of intervals 
Jmn satisfying the conditions 


EC i Jmn (for all m), 
contains a diagonal sequence 
Ftnse I tes eeege Fam 
such that 
imp ee a 
It can easily be shown that every C’’-set is a C-set [cf. 2]. 


THEOREM I. The non-existence of Q-limits (for dyadic sequences) implies that 
every linear set of power &%, has property C (and also C’’). 
Received June 20, 1950. 
111 














112 FRITZ ROTHBERGER 


We shall actually prove it for C” and we shall give two proofs. 


THEOREM II. The non-existence of Q-limits implies the following proposition: 
Given any family of ®&, infinite sets of natural numbers E* (a < Q), there exists a 
set D such that E* -D and E* -CD are infinite sets, for alla. (CD means: comple- 
ment of D.) 


There is a stronger theorem, from which the above two follow, namely: 


THEOREM III. The non-existence of 2-limits implies the following proposition: 
The sum of &, (linear) sets of first category is again of first category. 


For the proofs of these theorems we need a few preliminaries. 

The abbreviation “of n.n.”” means “of natural numbers.”’ The letters y, », 
m,n, r, s, t, (without or with subscripts) will always denote natural numbers; 
and the letters a, b, c, d, (without or with subscripts) will denote “‘segments,”’ 
to be defined presently. Sets of segments, and also other sets, will be denoted 
by capitals, A, B,.... 

A finite sequence of n.n. (71, r2,..., %) will be called a segment. The first m 
terms (m < m) of a segment form a subsegment. Example: (1, 3, 5) is a sub- 
segment of (1, 3, 5, 7, 9), but (1, 5, 9) is not a subsegment of it in our sense. 

The word ‘“‘sequence” shall always mean ‘infinite sequence.” 

Two sequences of n.n., {s,} and {t,}, will be said to intersect if we have s, = t, 
for some value of m. (In [3, p. 118, Lemme 5], two sequences were said to be 
“tout-a-fait différentes”’ if and only if, in this sense, they do not intersect.) 

A sequence s, and a segment (rj, 72, . . - , %m) intersect, if s, = Tf, for some n < m. 

For a given set -” of sequences of n.n., a diagonal sequence is a sequence (not 
necessarily in -7 ) which intersects each element of -7. If such a sequence exists, 
we shall say that -* admits a diagonal. 

We quote five theorems from the other papers, for later use: 


(1) [3, p. 119, Lemme 6]. The proposition ‘Every linear set of power &, has 
property C’” is equivalent to the following: ‘Every set of sequences (of n.n.) of power 
NS, admits a diagonal.’ 


(2) [3, p. 120, Lemme 8]. The existence of a non-C” set of power &%, implies 
that the interval (0, 1) is the sum of &, sets of first category. 


(3) [4, p. 34, Theorem 3°]. The non-existence of 2-limits implies the proposition 
B(N:), 7.¢., the non-existence of (Q, w*)-gaps. 


(4) [4, p. 37, Lemma 5]. B(S:) is equivalent to the following proposition: ‘If 
Y, < X, for all n < w,a <Q (X's and Y’s are sets of n.n.), then there exists a 
set D such that Y, < D < X, for all n and a.’ 


(5S) [4, p. 38, Lemma 7]. The non-existence of Q-limits (for dyadic sequences) 
implies the following proposition: ‘Given &, sets (of n.n.) X.q, if every finite product 
of X’s is an infinite set, then there exists an infinite set D, such that D < X, (for 
alla).’ (‘Finite product” means a product of a finite number of sets.) 


A PROBLEM OF HAUSDORFF 113 


The last two theorems, i.e., (4) and (5), are the clue to the proofs of this 
paper; they also contain, in a sense, the clue to [4, Chapter III]. We shall, 
however, have to replace, in (4) and (5), the words “of n.n.”” by “‘of segments” 
and later on by “‘of rational numbers.” This is permissible, since, in theorems 
of this type, the set of all natural numbers may be replaced by any other de- 
numerable set, e.g., the set of all segments. 


(4’), (5’). Same as (4), (5), with “segments” or “rational numbers”’ in place 
of “n.n.” 


2. Proof of Theorem I. We need the following two new lemmas, which are 
obvious: 


LEMMA (i). Given a finite set of sequences (of n.n.), there exist infinitely many 
diagonal segments, a diagonal segment being a segment intersecting each of the given 
sequences. 


LEMMA (ii). Given a segment b = (rj, ro, . . . , Tm) and a finite set of sequences, 
there exist infinitely many diagonal segments starting with (11, T2, . . . , Tm), 1-€-5 
having b as a subsegment. 


Proof of Theorem 1. Let 


SS” i - oo). (a < Q) 
be a fixed, but arbitrary, set of Ni sequences of n.n. 
Assuming that no Q-limits exist, it is sufficient to show that the above set 
admits a diagonal sequence, cf. (1). 
Let A* be the set of all segments intersecting {si}. It follows from Lemma (i) 
that every finite product 


n 


I] A* 


ven! 
is an infinite set, hence, by (5’), there exists an infinite set (of segments) Dy such 
that Dy < A* (all a < Q). 

More generally, let A} be the set of all those segments which have b as a 
(proper) subsegment and intersect {s,}. Just as before, but using Lemma (ii), 
we see that every finite product is an infinite set, hence there exists an infinite 
set D,, such that D, < A} (all a). 

Since obviously A} < A*, we have D, < A*, for all a and all segments b. Now, 
the b’s form a denumerable set, and the a’s a set of power Ni, hence, by (3) and 
(4’), there exists a set D such that 


(6) D, < D< <A‘, forallaand all d. 


We shall use this set D to construct the required diagonal sequence. 
Let b, € D. Next, let 


bo - D,,-D. 














114 FRITZ ROTHBERGER 


(Such a segment exists, because D, -D is an infinite set for any b.) Note that d; 
is a subsegment of bo. Next, let 
bs € D,,-D, 
and, generally, let 
_ €em.... 
We see that d, is always a (proper) subsegment of 6,.,, hence all 5,’s are sub- 
segments of one common sequence. More explicitly, we can write: 


by = (ri, T2,.+-+ ‘Fads 
7) a a he | 
ee ee a ee ae 8 


In order to show that {7,} is the required diagonal sequence, it is sufficient to 
notice that, by definition, b,€ D for all m, and that, by (6), D< A‘ foralla. Thus 
b,€ A* for any given a and almost all m. Therefore, for any given a, almost all d, 
intersect {s¢}, and hence {r,} intersects {s¢} for all a. 

Theorem II can be proved in a similar way, but we shall rather deduce it from 
Theorem III. 


3. Proof of Theorem III. We need the following result: 


(7) [3, p. 112, Théoréme 1,B;3]. B(S&:) is equivalent to the following proposition: 
‘The sum of &.F,’s disjoint from N is contained in an F, disjoint from R, where R 
is the set of all rational numbers.’ 


Now, the set 9 may be replaced by any other dense denumerable set D; 
also, the sum of N,F,’s is equal to the sum of &; closed sets (because Ni - No 
= ,). From this, together with (3) and (7), we have the following: 

LEMMA (iii). The non-existence of Q-limits implies the proposition: ‘The sum of 
N closed sets disjoint from D, is contained in an F, disjoint from D; where D is 
any everywhere-dense denumerable set.’ 


Taking complements, we get the following: 


LEMMA (iv). The non-existence of Q-limits implies that the product of &%, open 


sets or G;'s containing D, contains a G; containing D (where D is everywhere 
dense and denumerable). 


Proof of Theorem \11. Without loss of generality, the sets of first category in 
the proposition in the theorem may be replaced by F,’s of first category, i.e., by 
non-dense F,’s. Then, by the same argument as above, the proposition may be 
changed to the following one: 


The product of &, everywhere-dense open sets contains an everywhere-dense G3. 


A PROBLEM OF HAUSDORFF 115 


Let G', G*,..., G*,..., G*,... (a <Q) be &; everywhere-dense open sets. 
It is sufficient to prove that they contain an everywhere-dense G;, assuming the 
non-existence of 2-limits. 

Let A* be the set of all rational numbers contained in G*. Since every finite 
product of G*’s is again an open set, every finite product of A*’s is an infinite set. 
Hence, by (5’), there exists an infinite set Dy with Dy < A* (for all a). 

Now let J be any interval with rational endpoints. Then J -A* is the set of 
rational points in J -G*. Again, any finite product of these sets J -A* is an infinite 
set. Hence there is an infinite set D, with D,; < J-A* < A‘, for all a. Thus 
we have: 


(8) D,C J, for all J’s, 
(9) D, < A*, forany J and anya. (There are No J's and &; a's.) 


Therefore, by (3), (4’), and (9), there is a set D with D,; < D < A’*, for all 
J and a. 

Now, since D, is an infinite set, it has, by (8), at least one accumulation point 
in the closure of J, and this accumulation point is necessarily an accumulation 
point of D, because almost all' elements of D, are elements of D. Thus we see 
that D has accumulation points in every interval J, therefore D is everywhere- 
dense (and denumerable). 

Also, since D < A*, we have D C A* + A*, where the A*’s are certain sub- 
sets of R; and since A* C G*, we finally have 


(10) DCG + A’, forall a. 


We may now apply Lemma (iv), for the left hand side of (10) is everywhere- 
dense and denumerable, whereas the right hand side is a G; (because the sum of 
an open set and a finite set is always a G;). 

It follows, from Lemma (iv), that there exists a G;, say E, such that 


(11) DCECI[G@+ a) cI[@¢+R. 


From D C E it follows that E is everywhere-dense, therefore it is everywhere of 
second category (because it is a G;). Therefore, E — ® is still everywhere- 
dense and is obviously still a G;. Finally, we see from (11), that E — ® is 
contained in all Gt. Thus E — & is the G; which we set out to find. 


4. Proof of Theorem II. To every set of n.n. there corresponds a dyadic 
“decimal” representation of a real number belonging to the interval [0,1]. A set 
of sets of n.n. is said to be non-dense, or of first category, if the corresponding 
set of real numbers is non-dense, or of first category. Let E be an infinite 
set of n.n. Then the linear set corresponding to the set of X’s such that E C X 
is a Cantor discontinuum, and thus non-dense, and the set of X’s such that 
E < X is of first category, being the sum of No non-dense sets. 


‘almost all’’ means ‘‘all but a finite number of.” 














116 FRITZ ROTHBERGER 


Similarly, the set of all X’s such that E < CX is also of first category. Hence, 
the set of all X’s such that 


(12) either E-X = A or E-CX =A, 


is of first category. Therefore, given a set of &, infinite sets E* (a < Q), and 
assuming that there are no Q-limits, it follows from Theorem III that the set 
of all X’s such that 


(13) for some a, either E*-X = A or E*-CX = A, 

is likewise of first category. Hence the complement of this set of X’s is not 
empty (because of second category), so that there exists an infinite set D (be- 
longing to the said complement and thus satisfying the negation of (13) ), such 


that 


(14) for all a, both E*-D and E*- CD are infinite sets, 


which proves the theorem. 


5. Alternative proof of Theorem I. It follows from (2) (reversing the 
implication) that, if the sum of &; sets of first category is always also of first 
category, then every set of power &, has property C” 
Theorem III, we have our theorem. 


Combining this with 


REFERENCES 


1. F. Hausdorff, Summen von $&1 Mengen, Fund. Math., vol. 26 (1936), 247 

2. F. Rothberger, Eine Verschairfung der Eigenschaft C, Fund. Math., vol. 30 (1938), 54. 

3. F. Rothberger, Sur les familles indénombrables de suites de nombres naturels et les problémes 
concernant la propriété C, Proc. Cambridge Philos. Soc., vol. 37 (1941), 109-126. 

4. F. Rothberger, On some problems of Hausdorff and of Sierpinski, Fund. Math., vol. 35 
(1948), 29-46. 


University of New Brunswick 


A REMARK ON THE EXISTENCE OF A DENUMERABLE 
BASE FOR A FAMILY OF FUNCTIONS 


FRITZ ROTHBERGER 


A family F of functions is said to have a denumerable base if there exists a 
sequence of functions {f,(x) } (not necessarily © F) such that any function 
f € F is the limit of a subsequence of {f,(x)}. The domain X of a function 
f(x) is the set of x’s for which f(x) is defined; we say f(x) is a function on X. 
A dyadic function is a function taking only the values 0 and 1. 

Let F be a family of dyadic functions on a set X. 

PROPOSITION (m, n). Jf F = m and X =n, then the family F has a de- 
numerable base. 

In an earlier paper | have shown that the proposition (Ni, &:) is true [1, 
p. 401, Theorem 3]. Hence, the continuum hypothesis implies the proposition 
(c,c) [ibid., Corollary]. 

The problem is whether or not the proposition (c,c) can be proved inde- 
pendently (i.e., merely with the axiom of choice, but without any additional 
hypothesis such as the continuum hypothesis). We are going to prove a theorem 
which throws some light on this problem. 

First, we need two lemmas (proofs omitted): 


LeMMA A. Jf ny > te, then proposition (m, 1) implies proposition (m, Ne). 


LemMMA B. Jf &. < ¢, then proposition (c,c) implies proposition (c, &a). 


These will enable us to prove the following 


THEOREM. If there exists an a and a 8 such that 


(1) Ru<Ns Re<e<,, A =n, 
then the proposition (¢, &.) is false. 
For example, 
J os Si 
a=1,68=2,N:<c<N, =2 . 


Incidentally, the first relation in (1) is redundant: it follows from the third one 
by Koenig's theorem. 
From this theorem, together with Lemma B, we have: 


CoROLLARY 1. Jf &., Ne satisfying (1) exist, the proposition (c,c) is false 
We have, in particular (special cases): 


Received June 20, 1950. 


117 














118 FRITZ ROTHBERGER 


COROLLARY 2. The proposition (c,c) is false if any one of the following propo- 
sitions (2), (3), (4), holds: 


(2) &: = K... and ®:<c <&,,,; 
(3) fs =, and &3<c¢ <B&,,; 
(4) al awe t.F .. dt. 


Note. In what follows, a, 8 are “‘constants’”’; y, & are ‘‘variables.”’ 


Proof of Theorem. We assume (1), and we are going to construct a counter- 
example for the proposition (c,&.). Let G be the family of all dyadic functions 
on X, where X = W,. Then G = gh: hence, by (1), 

(S) G=X,,.. 

Given a sequence {¢,(x)} of dyadic functions on X, let F, be the family of 
those functions which are limits of subsequences of {¢,(x)}. Furthermore, 
let & be the family of all sequences (of dyadic functions on X), and let 7 be 


the system of all families F,; where ¢ € ®. 
It follows that 


(6) F<, b= (yhoe Bey 
and _ " 
(6) SFY<e and GK< F%. 


(The last inequality follows from the fact that every element of G corresponds 
to a one-element family F, whose base converges.) 
Hence, from (1), (5), (6), and (6’), we have: 


(7) Fyo<e<m,, G=S=8 


Ws" 
Now, since F, is the “maximal” family with the base ¢, every family of 
functions admitting a base is contained in some F,, i.e., 


(8) If F has a base then, for some ¢, F C Fy € 7% 


On the assumption of (7) we are going to construct a family F° of power 
< N~e which is not contained in any F, 7, which therefore, according to 
(8), has no base. 

Let 


Fr, Fay .+ + Fay +++ Fess las, 


c . - . 
be the elements of -% ordered in a transfinite sequence. 
We put 


(9) H, = > F; (y < @,), 
Kw, ; 


and, for each y < wz, let h, be one element of the set G — Hy: 





THE EXISTENCE OF A DENUMERABLE BASE 119 


(10) h, €G-H, (y < wp). 
This element h, exists because 


H, < cS, < R., 


but G= Ny, hence the set G — H, is not empty. 

Now let F° be the set of all these h,. Then F° <Ns.<c. It follows from 
(10) that h,¢ F; for all &<w,. But A, € F° by definition. Therefore F° 
cannot be contained in any F; (for all — < te), i.e., in any Fy, € 7 


Thus, by (8), F° has no base, although 
Fe < ¢. 


REFERENCE 


1. F. Rothberger, On families of real functions with a denumerable base, Ann. Math., vol. 45 
(1944), 397-406. 


University of New Brunswick 











AN EXTENSION OF MEYER’S THEOREM ON 
INDEFINITE TERNARY QUADRATIC FORMS 


BURTON W. JONES 


1. Introduction. Let f be a ternary quadratic form whose matrix F has 
integral elements with g.c.d. 1, that is, an improperly or properly primitive 
form according as all diagonal elements are even or not. Let d be the determi- 
nant of f (denoted by |f|), @ the g.c.d. of the 2-rowed minors of F. Then 
d = 2°A determines an integer A. Two forms / in the same genus have the same 
invariants 2, A, d. The form whose matrix is adj F/Q is called the reciprocal 
form of f. A theorem of Meyer, as extended by Dickson [1], who completely 
reworked Meyer’s inadequate proof, is the following: 


THEOREM 1. Jf f; and f2 are two properly or improperly primitive indefinite 
ternary quadratic forms in the same genus, they are equivalent if 
(1) (Q, A) < 2,2 € 0 (mod 4), A # O (mod 4). 

Meyer [3] also gave the number of classes in a genus of ternary indefinite forms 
in terms of sets of quadratic characters with respect to the primes common to 
2 and A, but his proofs are obscure. Siegel recently showed the author that 
the forms 

f = xi — 2x3 + 64x53, g = (2x1 + x3)” — 2x3 + 1605 
are in the same genus but are not equivalent since the latter represents no 
perfect square whose factors are all congruent to 1 (mod 8). It is the purpose 
of this article to give a large set of genera of one class whose invariants are not 
relatively prime. 

Let p be an odd prime factor common to Q and A. It is well known [2, Theorem 
25] that for k arbitrary, f is equivalent to a form 


2 2 2 2, 
(2) fo = 4x1 + Pp aex2 + pax; (mod p*), (a1, p) = 1. 
Then the transformation K: x; = pyi, x2 = Yo, X3 = ys, takes fy into pg 
where g is a form whose matrix has integral elements and 
2 2 2 k-1 
g = payyi + pazy2 + azy3 (mod p+). 


We call g the related or p-related form of f and shall prove 


THEOREM 2. If a form g above is in a genus of one class, uf p* does not divide \g\, 
and if there is an integer q, prime to p and satisfying the following conditions: 
(i) |q| is an odd prime or double an odd prime; 
(ii) — g ts represented by the reciprocal form of g; 
(iii) every solution of the congruence 
120 


INDEFINITE TERNARY QUADRATIC FORMS 121 


(3) x* — qy® = 1 (mod p) 

is congruent (mod p) to a solution of the Pell equation 
(4) x* — gy’ = 1; 

then the form f is in a genus of one class. 


Notice that (ii) imposes only congruence conditions on g and that g must be 
double a prime if the reciprocal of g is improperly primitive. 
Theorems 1 and 2 then imply 


COROLLARY 1. There is only one class in the genus of a (properly or improperly) 
primitive form f if 
(i) 2 # 0 (mod 4), A # 0 (mod 4); 
(ii) for any odd prime factor p dividing both Q and A, it is true that p* does not 
divide |g| and there exists a q satisfying the conditions of Theorem 2. 


The conditions of Theorem 2 will be further considered in §4. 


2. Equivalence of f,; and f, implies that of g, and go. We consider f; and /2 
two primitive forms of the same genus. Then [2, Theorem 40] we may assume 
f, and fz congruent modulo an arbitrary power of ». Suppose U = (u,,;) is a 
unimodular transformation (determinant + 1, integral elements) taking f/f; into 
fe, then 

u11 isp usp 
K""UK =| pun ur 103 , 


Pus Use U33 


which is unimodular if u;. = 4;; = 0 (mod p) and takes g; into go. Now U 
takes f; into f2, both of the form (2), which implies: 


9 ” 
Ay (UyX1 + Mieke + tUygX3) + Pas(usix1 + UsaX2 + Us3%s)” 
= a,x} + pasx; (mod p’). 


This implies 


AyUj2 = ayu4j3 = 0 (mod p) 
which, since (a;,p) = 1, implies u:2 = u,3; = 0 (mod /) which completes our 
proof that f; & fe implies g: = ge where = is the sign for equivalence. Hence 


the number of classes in the genus of f is not less than the number of classes in 
the genus of g. 


3. Conditions under which g,; > g. implies f,; > ff. As above, we may as- 


sume g; and ge congruent modulo p*. Now let the unimodular transformation 
U = (uy,) take g; into g.. Then KUK™~ takes f; into fs, 


“11 Pur. pli; 
-r rp! - 
KUK™ =| tap U22 U3 
—1 
Uzip U32 U33 











122 BURTON W. JONES 


and we need #2; = us, = 0 (mod fp). But 


G3 (tu siX1 + UsoX%2 + UssX3)° = asx3 (mod p) 
follows from that fact that U takes g; into g and g; and g, are both in form 
mod p*~' given above. This implies u3; = u32 = 0 (mod p) since a; = 0 (mod p) 
would imply * a divisor of lg! contrary to hypothesis. It remains to make 
ue, = 0 (mod p). This we do by showing that under certain circumstances we 
can find an automorph P of g such that the last two elements of the first column 
of PU are divisible by p. 
Write G, the matrix of g, in the form 


B b B 0O k—1 
[2 “" |- | 23 : | (moa p ). 


Since, under the conditions of Theorem 2, the reciprocal form of g represents 
— q (mod p*-") we may take |B] = — g. Let the unimodular transformation 
U taking g, into ge be written 
v(t 2 
u2 U33 


where “2 = (31,%32) = (0,0) (mod p). We shall first prove 


Lemma 1. If B has an automorph A such that 
(i) (A * I)B™ is integral for proper choice of +, 
(ii) A = Up (mod p), 
then an integral 1 X 2 matrix w may be determined so that 


A w 
p=|4 +1 | 


and hence P— are integral automorphs of G and 


1 0 13 
P"U=|0 1. wuss | (mod p). 
0 0 +33 
In order to prove this, we need to make P’GP = G, that is 
A’ pBA pA” Bw + pA’; PB pb, 
(S) T T T T T -_ T . 
pw BA + pbiA pw Bw + pbiw + pw b+) pb) b 


But A’BA = B and, if we can determine an integral w so that 
(6) A* Bw + A*b; = bi, 
|P| = + 1 with |B| ¥ 0 implies that b is equal to the corresponding member in 
the left-hand matrix of (5). However (6) is equivalent to 
BA™'w = ¥ (A” = I)h, 


or 


w= F ABA? I)b, = = (1 A)B'b; = (A ¥ ISB. 


INDEFINITE TERNARY QUADRATIC FORMS 123 


Hence w is integral if condition (i) of the Lemma holds. Furthermore, }; = 0 
(mod p) implies w = 0 (mod p). 
If, in addition, condition (ii) holds, we have 


-+_ [4° +4 w |-[4 { 
. -|4 +1 "lo «1) ‘od? 


Pree | 0} & uy |-[2 ‘si 
P / =|4 “+ 0 U33 so 0 +U33 (mod P), 


and our proof is complete. That is, we can, under the conditions of Lemma 1, 
find a transformation U taking g; into gs for which ua, = uy; = 0 (mod p). In 
other words, g; = ge implies f; = fo. 


It may easily be verified that 


~ t — bu — cu 
(7) A -|'; et 


is an automorph of ax* + 2bxy + cy’, the form whose matrix is B, if t,u is a 
solution of x? — gy? = 1, where — g = ac — b*. We prove 


LEMMA 2. Condition (i) of Lemma 1 holds if A is expressed in form (7) with 
t = + 1 (mod q). 
To prove this, note that 
~ —1_ od c(t = 1) om ath 
AF Ds = (re nn a(t ¥ 1) , 
which is integral if ¢ = + 1 (mod gq). Notice that any solution of x? — gy? = 1 
satisfies the condition if g is an odd prime or double an odd prime. 
Now, as may be shown in the same way as one establishes the automorphs 
of a binary form, 
U%BU> = B (mod p) 


implies, for p an odd prime, 


U.= ! mm ~ cn A (mod p) 
au t + bu ; 
where ¢’/? — qu’? =1 (mod p). Hence if there is a solution ¢,u of the Pell 
equation x? — gy? = 1 such that ¢ = # (mod p) we have gu? = qu’? (mod p) 
and thus by proper choice of sign of u’ we have A = U>» (mod p). We have 
proved 


Lemma 3. Jf for every solution t',u' of the congruence x* — qy* = 1 (mod p) 
there is a solution t,u of the Pell equation x* — qy’? = 1 such that t = t' (mod ), 
condition (ii) of Lemma 1 holds. 


These three lemmas establish Theorem 2. We now consider in more detail 
the conditions (ii) and (iii) of Theorem 2 and investigate the permissible values 
of p and gq. 











124 BURTON W. JONES 


4. Modifications of the conditions of Theorem 2. Consider first the condition 
that — q be represented by a ternary quadratic form h whose determinant 
is prime to g. We shall prove 


THEOREM 3. If h is an indefinite ternary form satisfying the conditions of 
Theorem 1, it represents — q with (q, |h| ) < 2 if and only if it represents — q in 
R(2), the ring of 2-adic integers, and in R(r) for every odd prime factor of Q, that is, 
if h = — q (mod 1) is solvable for every such r. 


We know from Corollary 44b of [2] that if h represents — gin R(r) forr = 
and every prime factor, 7, of 2 |h\q, there is a form h’ in the genus of h which 
represents — g. But our Theorem 1 implies that h’ is equivalent to h which 
therefore represents — q if h’ does. Since h is indefinite it represents — gq in the 
field of reals. It remains to show that / represents — g in R(r) for r an odd 
prime factor of qg |h|. If r = g or 3g, Corollary 34b of [2] gives the desired result. 
Now for any odd prime r we may consider 


h = a4x; + aoxe + a3x3 (modr’). 


First, if ayja2 # 0 (mod r), then 
@)x| + ax; = — gq (modr’) 


solvable shows that / represents — gin R(r). Second, two of a;,a@2,a; are divisible 
by r if and only if r divides 2. Suppose a; = a. = 0 (mod r). Then h = —g 


is solvable in R(r) if and only if h = — g (mod r) is solvable [2, Theorem 9a]. 
This completes the proof. 


Since g is a ternary form adj(adj G) = dG where d = |G}. If Q is the g.c.d. 
of the 2 X 2 minors of G it divides all elements of dG, and g primitive implies 
d = 2°A, where A is an integer. Furthermore, d is the g.c.d. of all elements of 
adj (adj G) and hence of all 2-rowed minors of adj G. This implies that A is the 
g.c.d. of the 2-rowed minors of the matrix of the reciprocal form of g. Hence 
we have 


THEOREM 4. Let p be a fixed odd prime and f a primitive form for which 
2 = A = 0 (mod p), neither Q nor A being divisible by 4 or p*, and g its p-related 
form. Then the reciprocal form of g represents — q if and only if it represents it in 
R(r) for all prime divisors r of 2A/p. 


II 


This has the effect of imposing on — g certain conditions modulo powers of 2 
and mod r for odd prime factors of A/p. 


COROLLARY. Condition (ii) of Theorem 2 may be replaced by the conditions of 
Theorem 4. 


Now let us consider further the condition (iii) of Theorem 2. It may be shown 
that the number of solutions of the congruence (3) is 


> — (q p). 


INDEFINITE TERNARY QUADRATIC FORMS 125 


The number of solutions with y = 0 is 2, with x = Ois 1 + (— qip). 


Hence the 
number of solutions with neither x nor y. zero is 


b — (gip) — (-— gp) -— 3 
and the number of distinct pairs of solutions x*,y? with neither zero is one fourth 
of this number. Hence the number of distinct (mod /) pairs x*,y? of solutions is 


M 


tip — (qip) + (— g\p) + 3}. 


That is 


ll 


M = }(p + 3) if p = 1 (mod 4), 

M = i(o + 1)ifp 

M = t(p+5)ifp 

First we consider two special cases. Suppose p = 3 and g = 1 (mod 3). Then 
there is only one pair of solutions of the congruence, namely, x* = 1, y? = 0 


(mod 3), and hence condition (iii) of Theorem 2 holds. Then from Theorem 4 
and Corollary 1 we prove 


Il 


= — 1 (mod 4) and (¢|p) = 1, 


lil 


— 1 (mod 4) and (qg|p) = — 1. 


THEOREM 5. An indefinite primitive ternary quadratic form f is in a genus of 
one class provided 
(i) (Q,A) divides 6, 
(ii) 2 # 0 # A (mod 4), 
(iii) |f, # O (mod 81). 


To prove this we need merely show the existence of a prime or double a prime 
g with (q¢\3) = 1 and satisfying the conditions of Theorem 4. This means that 
gq = 1 (mod 3) and satisfies certain congruence conditions modulo powers of r 
where r is a prime factor of 24/3. Dirichlet’s theorem shows that such a q 
exists provided that these conditions are consistent and the conditions of the 
theorem imply that A/3 is not divisible by 3. This completes the proof. 


Furthermore, for p = 3, (g|3) = 1, condition (iii) of Theorem 2 holds even if 
q is negative and g a positive form. Thus we have 


THEOREM 6. For p = 3, a positive ternary quadratic form f is in a genus of only 
one class if its 3-related form g is, and if |f| # 0 (mod 81 


Two examples are 


f =x* + 18y? + 32", g = 3x° + Gy’ + 2’, 


f=x°+18y° +62, g = 3x” + Gy’ + 22’. 


Group theoretic considerations lead to another special case of interest. Let 
T,U be the fundamental solution of x? — gy? = 1. It is well known that all 
solutions are given by 


} > UnWq = + ag + UV/q)" 


for integral powers of m. Hence under this law of combination, the solutions 











126 BURTON W. JONES 


(mod ~) of the Pell equation form a multiplicative group H, which must be a 
subgroup of the multiplicative group of solutions of the congruence (mod ). 
Hence s, the order of H,, is a divisor of 2u = p — (g\p). Condition (iii) of 
Theorem 2 will be met if and only if s = 2u. Now s must be even since (t,u), a 
solution of the Pell equation, implies that (— ¢,u) is a solution and (0,u), a so- 
lution, implies that (0,— u) is. Hence s = 2s’. But s > 2 unless, for the funda- 
mental solution, UV = 0 (mod #) and, with this exception, u a prime would imply 
s’ = u and s = 2u. Hence, if for proper choice of sign 4(p + 1) is a prime, 
condition (iii) of Theorem 2 holds and g may be chosen to satisfy conditions (i) 
and (ii) unless U = 0 (mod p) for the fundamental solution of the Pell equation. 

To consider the general case we notice again that any solution f¢,u of 
x? — qgy® = 1 is expressible in the form 


ty + upg = + (T + Uv)’ 
where 7,U is the fundamental solution. Now 


te + up/q = t, + usv/q (mod p) 
implies 


t, — Urq = t, — UsvV/q (mod p) 

where if (q/p) = — 1 by such a congruence we mean that corresponding parts 
are congruent and if (g\p) = 1 we replace 4/q by a solution of g = r? (mod ). 
Hence t, = t,, since p is odd and thus u, = u,. 

First, if (g\p) = 1, there are »—41 solutions of the congruence and 
+ (T + U-+/q )* yields all solutions if and only if one of the following holds: 

(a) a = T+ Uv isa primitive root (mod ). 

(b) w belongs to $(p — 1) (mod p) and no power of w is congruent to — 1 
(mod p). 

We can show that condition (b) may be replaced by 

(b’) w belongs to 3(p — 1) (mod p) and p = 3 (mod 4). 


Suppose p = 1 (mod 4). Then w belonging to $(p — 1) would imply w' = — 1 
(mod p) for t = }(p — 1). On the other hand, if p = 3 (mod 4), ow‘ = — 1 
(mod ~) would imply 3(p — 1) divides 2¢ and since the former is odd it must 
divide ¢. This would make it impossible for w to belong to $(p — 1). 

Second, if (q|p) = —1 there are p+ 1 solutions of the congruence and 
+ (T + U-+/q)* yields all solutions if and only if one of the following holds: 

(a) w belongs to p + 1 (mod p). 

(b) w belongs to $(p + 1) (mod p) and no power of w is congruent to — 1 

(mod p). 


As above, we may replace condition (b) by 


(b’) w belongs to $(p + 1) (mod p) and p = 1 (mod 4). 





d 


INDEFINITE TERNARY QUADRATIC FORMS 127 


5. Examples. We consider p = 5 and p = 7, giving explicit conditions for 
primes g or doubles of primes g satisfying condition (iii) of Theorem 2 and append 
a short table of values. 


p=5 
Case 1. Suppose (q|p) = 1. The primitive roots (mod 5) are 2 and 3. Let 
a* = q (mod 5) and have 


T* — a’U* = 1 (mod 5), T — aU = + 2 (mod 5) 
imply 


T + aU = + 3 (mod 5) 
and hence 


T = 0 (mod 5) 
is the necessary and sufficient condition for (iii) of Theorem 2, since 7? = — 1 
(mod 5) would imply a?U? = — 2 (mod 5) which is impossible. 
Case 2. Suppose (q|p) = — 1. Since p + 1 = 2 (mod 4) we want w # + 1 


(mod 5) and w* = + 1 (mod 5). Now 
w = T° + gU* + 2UTV/q = 1 (mod 5) 
only if UT = 0 (mod 5). But T = 0 (mod 5) would imply — gU? = 1 (mod 5) 
which would deny (q\p) = -—1. Hence U =0 (mod 5), T = +1 (mod 5) 
which must be excluded. Thus the necessary and sufficient condition for (iii) is 
T = + 2 (mod 5). 

We can include both case 1 and 2 by writing 
(8) T = 0, + 2 (mod 5). 

The prime and double prime values of g less than 50 for which (8) holds are: 

3, 6, 7,11, 14, 17, 19, 22, 31, 34, 37, 38, 43, 46, 47. 


In terms of our general results this means that 2 and A may have a common 
factor 5 if the negative of one of the numbers in the table is represented by the 
reciprocal form of g. 


p=7 
Case 1. Suppose (q|p) = 1. The primitive roots (mod 7) are 3 and 5. Here 
we want w* = +1 and w # +1, all congruences being (mod 7). Suppose 
T+aU = +1; then T = + 1 which is excluded. Similarly it is easily shown 
that T = 0 and JT = + 2 are impossible. Hence a necessary and sufficient 
condition for (iii) is 


T = + 3 (mod 7). 


Case 2. Suppose (g\p) = — 1. Then w must belong to 8 (mod 7), that is, 
w* #+41. But 








BURTON W. JONES 


(T + Uvq)? = T° + Ug + 2TUVq = +1 
imply TU =0. Thus U = 0 and 7? =1 or T = 0 and gU? = +1 both of 
which are excluded. But 7? = 9 is impossible. We include both cases in 

(9) T = + 2, + 3 (mod 7). 


The prime and double prime values of g less than 50 for which (9) holds are: 


3, 5, 6, 10, 11, 13, 17, 19, 23, 26, 37, 38, 41, 43, 46. 


Extensions of the results of this paper are being considered by the author and 
his students. 


REFERENCES 


1. L. E. Dickson, Studies in the theory of numbers (Chicago, 1930). 
2. B. W. Jones, The arithmetic theory of quadratic forms (Tenth Carus Monograph, Math. 
Assoc. Amer., 1950). 


. Meyer, Uber indefinite ternare quadratische Formen, }. Reine Angew. Math., vol. 116 
(1896), 317-325. 





